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ORGANIZER, GENERAL BACKGROUND, AND 
ANTECEDENT LEARNING VARIABLES IN 
SEQUENTIAL VERBAL LEARNING 


DAVID P. AUSUBEL axnp DONALD FITZGERALD 


Bureau of Educational Research 
University of Illinois 


An analysis of covariance design was used with undergraduate students 
in studying the effects of an advance organizer, antecedent learning, 
and general background knowledge on the learning and retention of 
2 unfamiliar sequential passages about endocrinology. The organizer, by 
providing relevant ideational anchorage, suggestively enhanced the 
learning of the material for those Ss with relatively poor verbal ability. 
Knowledge of the Ist passage constituted a statistically significant limit- 
ing condition in learning the 2nd passage when the influence of both 
verbal ability and general endocrinological knowledge was statistically 
controlled. Finally, general background knowledge in endocrinology 
significantly facilitated the learning and retention of the Ist passage, 


presumably by increasing the familiarity of the new material. 


Typically in the course of meaningful 
school learning, the learner is introduced 
to new materials which are variants of 
and hence relatable to previously learned 
concepts already established in his cog- 
nitive structure. Ausubel and Fitzgerald 
(1961) have shown that under these 
circumstances the learner’s ability to dis- 
criminate between the two sets of con- 
cepts has important implications for 
learning and retention. They demon- 
strated that such discriminability is 
partly a function of the clarity and sta- 
bility of the previously learned concepts, 
and that when discriminability is low 
because of inadequate prior knowledge, 
learning and retention can be enhanced 
by the use of “comparative organizers.” 

A somewhat different learning situa- 
tion, however, not uncommonly arises 
when, for various reasons, students are 
required to learn a new unit of sequen- 
tially organized material which cannot 
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be related to previously learned concepts. 
No specific referents, either explicit or 
implicit, are therefore available for this 
material in cognitive structure. Organ- 
izers, under these conditions, obviously 
cannot be used to increase discriminabil- 
ity; they can at best furnish ideational 
anchorage. 

This type of learning situation also 
provides an excellent model 2nd test of 
the transfer of training paradigm in 
sequential meaningful learning. When 
subject matter is organized sequentially 
(so that Part II is built upon and presup- 
poses knowledge of Part I) the student’s 
grasp of the earlier appearing ma- 
terial becomes a crucial factor influenc- 
ing his learning of the later appearing 
material. When materials are sequen- 
tially presented, therefore, the residue of 
the earlier appearing material in cogni- 
tive structure—i.e., its clarity, stability, 
and organization—constitutes the impact 





244 Davip P. AusuBEL AND DONALD FITZGERALD 


of the “antecedent condition” in the 
transfer of training paradigm. It be- 
comes in effect the anchoring post or 
ideational scaffolding in cognitive struc- 
ture for the learning of the later appear- 
ing material. If this ideational scaffold- 
ing is clear, stable, and well organized, 
it is reasonable to suppose that it pro- 
vides better anchorage for new learning 
and retention than if it is unclear, un- 
stable, and poorly organized. 

In addition to furnishing a theoretical 
model for sequential meaningful learning 
in terms of the familiar transfer of train- 
ing paradigm, an analysis of covariance 
design utilizing this type of learning 
situation makes possible a more explicit 
test of the proposition that the quality 
of Part I learning actually influences 
Part II learning outcomes when subject 
matter is sequentially organized. Typi- 
cally this proposition is assumed to be 
axiomatically true in the absence of 
proof because of the difficulty of con- 
trolling such significant variables as rela- 
tive learning ability (which accounts for 
much of the positive relationship be- 
tween Part I and Part II learning out- 
comes), and the availability in cognitive 
structure of relevant concepts relatable 
to and influencing the learning of Part II 
materials which were introduced prior to 
the Part I materials. In this study the 
influence of learning ability is held con- 
stant by statistical means, and the con- 
taminating influence of antecedent con- 
ditions prior to those established by the 
Part I materials is eliminated both by the 
use of wholly unfamiliar materials and 
by statistical control of general back- 
ground knowledge. 

This type of research design also 
makes it possible to test the effect of 
general background knowledge in the 
same subject matter area on the learning 
of explicitly unfamiliar new material. 
It is conceivable that the general level of 
sophistication can affect the learning of 
new sequential material—not by in- 


fluencing its discriminability from or 
relatability to cognitive structure, but 
more indirectly by increasing its famili- 
arity and hence the learner’s feeling of 
self-confidence in handling it. 

It is hypothesized, first, that the ex- 
pository organizer differentially enhances 
the learning and retention of completely 
unfamiliar material for those subjects in 
the experimental group who have less 
verbal ability, and who therefore tend 
spontaneously to structure new verbal 
material less efficiently and successfully. 
Second, it is hypothesized that as a 
result of the enhanced learning of Part I 
materials achieved by some members of 
the experimental group with the aid of 
the organizer in the sequential learning 
situation, the experimental group as a 
whole manifests superior learning and 
retention of Part II materials. Third, 
it is hypothesized that there is a signifi- 
cantly positive relationship between 
Part I and Part II retention scores in 
both experimental and control treatment 
groups, and that this relationship re- 
mains significantly positive even when 
the influence of learning ability and 
general background knowledge is elimi- 
nated. Finally, it is hypothesized that 
general background knowledge increases 
the learning and retention of Part I 
materials. 


METHOD 


Subjects 


The experimental population consisted of 
predominantly senior undergraduate students 
(91 women and 52 men) in six sections of an 
educational psychology course at the University 
of Illinois. All subjects were enrolled in 1 of 10 
teacher education curricula at the secondary 
school level. The experiment was conducted 
separately in each section as a required labora- 
tory exercise, and was performed during regu- 
larly scheduled class hours. In order to maxi- 
mize ego involvement, subjects were informed 
that after the data were processed their in- 
dividual scores, as well as the class results, 
would be reported to them. 
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Learning Passages, Organizers, and 
Measuring Instruments 


The learning material used in this study con- 
sisted of two specially prepared, sequentially 
related passages dealing with the endocrinology 
of pubescence. The first passage (approximately 
1,400 words) was concerned with the specific 
hormonal factors initiating and regulating pu- 
bescence. The second passage (approximately 
1,600 words) considered pathological variations 
in pubescence and their treatment. 

This particular topic was selected because it 
was anticipated that it would be almost com- 
pletely unfamiliar to undergraduate students in 
teacher education. Inasmuch as it dealt with 
new and rather esoteric ideas at the frontiers of 
medical physiology, we were confident that 
nothing in the previous academic background 
of our subjects had left a residue of established 
concepts in cognitive structure to which this 
material was specifically relatable. It was im- 
portant to satisfy this criterion for two reasons: 
First, since we were endeavoring to make the 
meaningfulness and comprehensibility of the 
second passage as dependent as possible upon 
knowledge of the first passage, it was necessary 
to make sure that the later appearing passage in 
the sequence was not relatable to existing con- 
cepts of other origin within cognitive structure 
(ie., of origin other than the earlier appearing 
passage). Second, since the organizer in this 
experiment, unlike the organizer in another re- 
cent study (Ausubel & Fitzgerald 1961), was 
designed merely to provide ideational anchor- 
age for the first passage, and not to increase its 
discriminability from analogous concepts within 
the subjects’ cognitive structure, it was neces- 
sary to avoid any possibility of relatedness be- 
tween the learning passage and established con- 
cepts. The use of unfamiliar learning material 
also made it possible for all subjects to start 
from approximately the same base line in learn- 
ing the two passages. 

Empirical confirmation of the unfamiliarity 
of the pubescence material was obtained when 
a group of comparable naive subjects, who had 
not studied the learning passages, made scores 
which, on the average, were only slightly and 
not significantly better than chance. 

Knowledge of the material in each pu- 
besence passage was tested by a 36-item multi- 
ple-choice test. The corrected split-half relia- 
bilities of the two tests were 83 and .74, 
respectively, and the correlation between them 
was .51 for the total group of subjects. Scores 
on both tests showed a satisfactory range of 
variability, and their distribution did not 
deviate significantly from the normal distribu- 


tion curve. Since they were intended as power 
tests, no time limit was imposed. 

The experimental group studied on two 
separate occasions a 500-word introductory 
passage (expository organizer) which provided 
an organizational framework for the first pas- 
sage in terms of the different kinds of uniform- 
ity and variability prevailing among the prim- 
ary and secondary sex characteristics. It was 
pitched at a much higher level of abstraction, 
generality, and inclusiveness than the pubes- 
cence material itself. The control group also 
studied on two separate occasions a 500-word 
introductory passage dealing with uniformity 
and variability among different cultures in be- 
havioral aspects of pubescence. This was in- 
tended solely as a control treatment and had 
no organizational properties whatsoever in rela- 
tion to the first learning passage. No informa- 
tion was included in either of the introductory 
passages that could constitute a direct ad- 
vantage in answering questions on the pubes- 
cence material. It was methodologically im- 
portant to provide a control (nonorganizing) 
introduction for the control group in order that 
any obtained differences in retention outcomes 
between the experimental and control groups 
be attributed to the organizing properties of 
the experimental introduction rather than to its 
presence per se (i.e., the Hawthorne effect). 

To test the effect of general background 
knowledge on the learning of the pubescence 
material a 20-item multiple-choice General 
Endocrinology Test (GET) was used. The dis- 
tribution of scores on this test did not deviate 
significantly from the normal distribution 
curve, and the corrected split-half reliability 
was 88. As a power test, this test also had no 
time limit. 

Scores on the verbal portion of the School 
and College Ability Test (SCAT), reflective of 
verbal ability, were available for all subjects 


Procedure 


On the first day of experimentation, all sub- 
jects first took the GET and then (after assign- 
ment to a treatment group) studied either the 
organizer or control introduction for 8 minutes. 
Membership in a given treatment group was 
determined by random assignment. The popu- 
lation of each of the three treatment groups 
was also stratified by sex so that the proportion 
of men to women subjects would be the same 
in each group. This was necessary because of 
the women’s significantly higher verbal SCAT 
scores. It was possible to administer the two 
treatments simultaneously because they con- 
sisted of identical appearing introductory pas- 
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sages (with identical sets of directions) differ- 
ing only in content. 

The principle of random assignment to treat- 
ment groups was relied upon to equate experi- 
mental and control groups with respect to rela- 
tive learning ability and background knowledge 
in the pubescence material. To further equate 
the treatment groups with respect to the latter 
variable, the relatively few biology majors in 
the group were identified and divided equally 
among the experimental and control groups. In 
any case, however, the influence of background 
knowledge was minimized by the esoteric 
nature of the material and its almost complete 
unfamiliarity to naive subjects (see above). 

In order to control for the effects of class- 
room climate, instructor, and situational varia- 
bles in the six sections, students within each 
section were equally divided among the two 
treatment groups. Since homogeneity of vari- 
ance prevailed, both between sections and treat- 
ment groups, for both the pubescence test 
scores and the GET scores, it was considered 
justifiable to pool the scores from the various 
sections and to treat the scores of the two 
treatment groups on each of these instruments 
as comparable random samples from the same 
large population. 

Two days after the first experimental session, 
both treatment groups restudied their respective 
introductory passages for 6 minutes and then 
studied the first pubescence passage for 25 
minutes. The test on the latter passage came 
48 hours later. The second pubescence learning 
passage was administered 3 days after the test 
on the first passage. Subjects were given 27 
minutes to read and study the second passage 
and were tested on it 4 days later. 


RESULTS AND DISCUSSION 
Effect of Organizer on Initial Learning 


Comparison of the means of the total 
experimental and control groups (Table 
1) shows that the organizer facilitated 
the learning and retention of the first 
pubescence passage (p=.07). Analysis 
of covariance similarly indicates that the 
overall effect of the organizer on reten- 
tion scores was not quite significant at 
the .05 level (Table 2). As hypothesized, 
however, practically all of the obtained 
near significant difference between ex- 
perimental and control groups was de- 
rived from the subjects in the lower third 


TABLE 1 


ApJUSTED MEAN RETENTION Scores OF EXPERI- 
MENTAL AND ContTRoL Groups OF First 
PUBESCENCE PassaGE ANALYZED BY 
VERBAL Asitity LEVELS 


Verbal ability 
(SCAT) 
Upper third 20.9 21.1 
Middle third 19.1 18.5 
Lower third 18.3 15.0 

Total 19.4 18.2 


Treatment group 


Organizer Control 


of the distribution of verbal ability 
scores. Experimental subjects in the 
lower third subgroup of SCAT scores 
learned and retained significantly more 
material than did control subjects in 
the lower third subgroup (p < .01), 
whereas differences between experimen- 
tal and control subjects in the upper 
third and middle third subgroups were 
negligible. 

The differential facilitating effect of 
the organizer on the learning and reten- 
tion of the first pubescence passage for 
the subjects with relatively low verbal 
ability is also revealed by the organizer’s 
leveling influence on the significantly 
positive relationship between SCAT and 
retention scores (Tables 2 and 3). The 


TABLE 2 


CovVARIANCE ANALYSIS OF First PUBESCENCE 
Test Scores X SCAT Levets, ApyustInc 


Adjusted 
df MS 


1 
2 
2 
5 
5 


Source 


Treatment 
SCAT level 
Interaction 
Between 
Within 


_ 
i) 


* 01 level of significance. 
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TABLE 3 


ParRTIAL CORRELATIONS BETWEEN SCAT, GET, 
PuBESCENCE Test I, AND PUBESCENCE 
Test 2 


Held Treatment group 


Correlation constant Organizer Control 
SCAT 31* 11 
GET 50** 
SCAT 33 14 
Test I 2 13 
GET 43** 


GET X Test I 
SCAT X Test I 
GET & Test II 
GET & Test Il 
SCAT X Test II 
SCAT X Test I 
Test 1 < Test II 
Test 1 X Test II 


Test I f 22 
SCAT 
GET 


47** 
57** 


* OS level of significance. 
** O1 level of significance. 


partial correlation between SCAT and 
retention scores, with GET scores held 
constant, was .50 (p < .01) for the con- 
trol group but only .19 for the experi- 
mental group (Table 3). Thus by differ- 
entially raising the retention scores of 
the lower ability subjects, the organizer 
reduced the intrinsic relationship be- 
tween SCAT and retention scores. 
Suggestive evidence therefore exists 
for believing that organizers, by provid- 
ing ideational anchorage, facilitate the 
learning and retention of totally unfa- 
miliar material for those subjects who 
have relatively little verbal ability. Sub- 
jects of average and better ability are 
evidently capable of spontaneously or- 
ganizing new learning material around 
relevant, more inclusive concepts, and 
hence derive little or no benefit from in- 
troduced advance organizers. This kind 
of interaction between verbal ability and 
the effect of organizers did not occur in 
a previous study where the learning 
material (Buddhism) was more ex- 
plicitly relatable to previously learned 
concepts (Christianity) in cognitive 
structure (Ausubel & Fitzgerald, 1961). 
It is possible that the potential facili- 
tating impact of the organizer on the 


learning and retention of the first pu- 
bescence passage was not fully realized 
in this study because subjects, due to 
inadequate background knowledge in the 
immediate subject matter area, experi- 
enced difficulty in relating the organizer 
to existing cognitive structure, and hence 
in learning it. Obviously, if the organizer 
itself cannot be firmly anchored to cog- 
nitive structure, it cannot serve as a 
very effective anchoring post. If possi- 
ble, therefore, in preparing organizers 
for an unfamiliar field of knowledge, an 
effort should be made to formulate the 
organizing principles in terms of con- 
cepts that are already familiar to the 
learner and established in his cognitive 
structure. The more unfamiliar and 
esoteric a given learning task is, how- 
ever, the more difficult it is to implement 
this precept. 


Effect of Organizers on Sequential 
Learning 


It had been hypothesized that the 
organizer, by virtue of facilitating re- 
tention scores on Test I, would thereby 
enhance the learning and retention of the 
second pubescence passage. Comparison 
of the means of the total organizer and 
control groups (Table 4), as well as 
reference to the covariance analysis of 
second pubescence test scores by Test I 
scores and treatment, holding either 


TABLE 4 


ApyusTEep Mean Retention Scores or Expeni- 
MENTAL AND Controt Groups on SEconp 
PUBESCENCE PassaGe ANALYZED BY 
Pusescence Test I Levers 


Pubescence Treatment group 


Test I scores 
Upper third 23 24.0 
Middle third 19. 21.0 
Lower third 19.2 18.1 

Total 20.5 21.0 


Organizer Control 
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TABLE 5 


COVARIANCE ANALYSIS OF SECOND PUBESCENCE 
Test Scores X Test I Levers, Apyustinc 
ror GET Scores 


x Adjusted 


Source 


Treatment 
Test I levels 
Interaction 
Between 
Within 
* .O1 levei of significance. 
** 001 level of significance. 

GET (Table 5) or SCAT (Table 6) 
constant, show that the hypothesis was 
not confirmed. This finding, of course, 
was hardly unexpected in view of the 
relatively slight and only suggestive fa- 
cilitating effect of the organizer on initial 
learning. Nevertheless, the significant 
partial correlations in the experimental 
group both between GET and Test IT 
scores (p < .01), with the influence of 
SCAT eliminated, and between GET 
and Test II scores (p < .05), with the 
influence of Test I eliminated, when con- 
trasted with the nonsignificant corre- 
sponding correlations in the control 
group (Table 3), suggest the occurrence 
of a positive interaction between the 
effects of the organizer and of general 
background knowledge. The organizer 
apparently helped the subjects utilize 


TABLE 6 


COVARIANCE ANALYSIS OF SECOND PUBESCENCE 
Test Scores X Test I Levers, ApyusTINc 
ror SCAT Scores 


s ; Adjusted 
ource dj MS 


Treatment 
Test I levels 
Interaction 
Between 
Within 


* .O1 level of significance. 


their existing background knowledge 
more effectively in structuring the second 
pubescence passage. 

In order for organizers to be really 
effective in enhancing sequential learning 
and retention, it would probably be 
necessary to use another organizer prior 
to the second passage. The second or- 
ganizer would attempt both to provide 
specific ideational scaffolding for the sec- 
ond passage, and also to increase dis- 
criminability between confusable con- 
cepts in the two passages. 


Effect of Initial Learning on Sequential 
Learning 


As hypothesized, Part I retention 
scores (initial learning) had a significant 
facilitating effect on Part II retention 
scores (sequential learning). Both ex- 
perimental and control subjects in the 
upper part of the distribution of scores 
on the first pubescence test made higher 
retention scores on the second pubes- 
cence test than did subjects in the lower 
part of the distribution of first pubes- 
cence test scores. This facilitating effect 
of antecedent learning (p < .001; p 
< .01) was manifested when either 
the influence of general background 
knowledge (Table 5) or of verbal 
ability (Table 6) was statistically 
controlled in a covariance design. Sta- 
bility and clarity of knowledge of the 
first passage therefore constituted a 
significant limiting condition with re- 
spect to learning and retaining the ma- 
terial in the second passage. 

Sequential organization of subject 
matter, therefore, can be very effective 
because each new increment of knowl- 
edge serves as an anchoring post for 
subsequent learning. This presupposes, 
of course, that the preceding step is 
always clear, stable, and well organized. 
If it is not, the learning of all subsequent 
steps is jeopardized. Hence new material 
in the sequence should never be intro- 
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duced until all previous steps are thor- 
oughly mastered. Perhaps the chief 
pedagogic advantage of the teaching 
machine lies in its ability to control this 
crucial variable in sequential learning. 


Effect of General Background Knowl- 
edge on Initial Learning 


The data clearly support the hypothe- 
sis that general background knoweldge 
in endocrinology facilitates the learning 
of unfamiliar material in the same sub- 
ject matter field (i.e., pubescence). This 
facilitating effect was significant at the 
.OS level when verbal ability was sta- 
tistically controlled in a covariance de- 
sign (Table 7). Significant interaction 
(p < .03) also occurred between the 
organizer and general endocrinology 
knowledge. Reference to Table 8 shows 
that the interactional effect was positive 
for the subjects in the upper third and 
lower third subgroups of the distribution 
of GET scores and negative for subjects 
in the middle third subgroup. The over- 
all interactional effect, however, was 
positive as shown by the contrast be- 
tween the significant partial correlation 
(p < .05) in the experimental group be- 


TABLE 7 


COVARIANCE ANALYSIS OF SECOND PUBESCENCE 
Test Scores X GET Levers Apyustinc 
ror SCAT Scores 


Adjusted 


Source 


Treatment 
GET level 
Interaction 
Between 5 
Within 131 
* 05 level of significance 
** O1 level of significance 


TABLE 8 


ApyUsTeD MEAN RETENTION Scores oF ExPeER!- 
MENTAL AND ContTroL Groups ON First 
PUBESCENCE PassAGE ANALYZED BY 
GET Levets 


General 
endocrinology 
knowledge 


Treatment group 
Organizer Control 
Upper third 21.1 183 
Middle third 18.4 208 
Lower third 18.6 16.1 

Total 19.4 18.4 


tween GET and intial retention scores, 
with SCAT held constant, and the non- 
significant corresponding correlation in 
the control group (Table 3). Similar 
positive correlations were reported above 
in relation to Test I scores. 

General endocrinology knowledge was 
too distantly related to the pubescence 
material to provide televant ideational 
anchorage. For the most part, it prob- 
ably facilitated the learning and reten- 
tion of the pubescence passage by in- 
creasing its general familiarity, and 
hence the subjects’ self-confidence in 
handling it. However, the positive inter- 
action between the effects of the or- 
ganizer and of general background 
knowledge in endocrinology suggests that 
the organizer better enables the subjects 
to put their background knowledge to 
effective use in structuring the unfa- 
miliar new material. This finding was 
consistent for both initial and sequen- 
tial learning. 
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THE EFFECT OF INTERMITTENT CONFIRMA- 
TION IN PROGRAMED INSTRUCTION ®* 


RONALD G. WEISMAN 
Michigan State University 


JOHN D. KRUMBOLTZ 
Stanford University 


AND 


The effect of intermittent confirmation was tested on 121 students by 
omitting various patterns of confirming answers from a programed 
textbook on educational measurement. The schedules included 4 levels of 
fixed-ratio confirmation and 2 of variable-ratio confirmation. Results 
based on criterion measures consisting of (a) errors made on the program 
and (b) performance on a posttest indicated: a negative linear relation- 
ship between the number of errors made on the program and the per- 
centage of confirmation provided, no significant effects on the posttest 
from the various proportions of confirmation, and no evidence of dif- 
ferential effect between fixed-ratio and variable-ratio confirmation on 


either criterion. 


Intermittent reinforcement has the 
well known effect of prolonging extinc- 
tion responding in simple response se- 
quences (Lewis, 1960). Experimental 
tests of the effects of intermittent re- 
inforcement in complex, meaningful 


learning are difficult to devise because 
of the problem of controlling the “rein- 


forcement.”’ Several recent attempts to 
devise such experiments have been re- 
viewed by Krumboltz (1961). 

In programed instruction knowledge 
of results may be considered a form of 
reinforcement. In many programed text- 
books the correct answer to a frame is 
placed on the page following the frame. 
A student can confirm each of his 
answers by comparing it with the cor- 
rect answer on the following page. Omit- 
ting certain of these answers provides 
an approach for investigating the effects 
of various schedules of confirmation. 

This approach cannot provide a com- 
pletely adequate way to assess the ef- 
fects of partial reinforcement because 
there are certainly other less explicit 
reinforcers in most programs. Occa- 
sionally the correct answer is stated or 


1This research was supported by a grant 
from the United States Office of Education, 
Educational Media Branch, under Title VIT of 
the National Defense Education Act. 
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implied in succeeding frames. Or if the 
preceding explanation has been ade- 
quate, a student may report that he knew 
he was correct even in the absence of the 
confirming answer (and sometimes even 
when a typographical error has made the 
confirming answer incorrect). Neverthe- 
less, the most prominent reinforcer is 
the knowledge of results provided by the 
confirming answer. Providing confirming 
answei_ on an intermittent schedule dur- 
ing learning of programed material may 
increase retention in the same manner 
that intermittent reinforcement prolongs 
extinction responding in less complex 
learning tasks. 


METHOD 


Subjects 

The subjects were drawn from 140 students 
enrolled in four sections of an introductory 
course in educational psychology during the 
summer of 1961. Due to absences the final 
sample size was reduced to 121. Within each 
section, the subjects were randomly assigned to 
each of the six reinforcement schedules. 


Materials 


The 177-frame programed textbook in this 
experiment was designed to teach prospective 
teachers how to interpret educational test re- 
sults. It covered material on percentiles, norms, 
means and medians, standard deviations, and 
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scores. The linear constructed-response pro- 
gram was not intended to develop computa- 
tional proficiency but only to develop a useful 
understanding. The program had previously 
been administered to 80 students and revised in 
accordance with observed error rates and per- 
formance on a criterion test 

The 50-item completion-type criterion test 
was also revised after an initial administration 
in accordance with an item analysis and 
changes in the program itself. Its estimated 
corrected split-half reliability was found to 
be 85. 


Procedure 


The same basic program was used with all six 
treatment groups. The only differences were in 
the number and order of confirming answers 
provided by the programed textbook. The six 
variations were as follows: 


1. Continuous confirmation answers 
every response were provided 

2. Fixed-ratio 679% confirmation 
firming answer to every third 
omitted. 

3. Variable-ratio 67% confirmation: on the 
average only two-thirds of the frames at ir- 
regular intervals were followed by confirming 
answers. 

4. Fixed-ratio 33% confirmation: the con- 
firming answer to every third frame only was 
included 

5. Variable-ratio 33% confirmation: on the 
average one-third of the frames at irregular 
intervals were followed by confirming answers 

6. No confirmation: all answers were elimi- 
nated. 

The above schedules were applied beginning 
at the eleventh frame in the 177-frame pro- 
gram. Each of the first 10 frames under all six 


the con 
Irame was 
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schedules was followed by confirming answers 
so that students would learn the usual pro- 
cedure for using programed textbooks. 

To prevent generalizing from just one specific 
set of omitted answers, the particular answers 
omitted were varied in both the fixed-ratio and 
variable-ratio booklets. One-third of the fixed 
ratio booklets were constructed to coincide with 
each of the three possible patterns of omitted 
answers at each of the two reinforcement rates 
A table of random numbers was used to prepare 
10 different versions of the variable-ratio 
schedule at each confirmation rate. Modifica- 
tions in the program were accomplished by 
inking out the designated answers with felt-tip 
pens. 

Booklets representing each of the six sched 
ules were arranged in random order and handed 
out to members of each class. Students were 
given sufficient class time (4 hours) to complete 
all 177 frames in class. All students were in 
structed to record their answers on a separate 
sheet of paper provided by the experimenters 

As soon as a student had completed the pro- 
gramed textbook, he was given the 50-item 
completion-type criterion test. Two criterion 
measures were analyzed: (a) number of errors 
made on the program itself, and (b) number 
of correct answers on the criterion test 


RESULTS 


A summary of the results appears in 


Table 1. Various tests of hypotheses 
based on these data were made to 
answer the following questions: 

1. Ignoring the variable-ratio sched- 


ules for the moment, do the remaining 
four schedules (all of which may be 


TABLE 1 


Sum™mary or Resutts ror Eacu ScuepuLe or CONFIRMATION 


Number of errors it 
me program 


SD M 


Schedule of 
confirmation 


177-fr 


M 


Continuous 
Fixed-ratio 67% 
Variable-ratio 67% 
Fixed-ratio 33% 
Variable-ratio 33° 
No 


Correlation 
bet ween 
criterion test 
and grade 
point average 


Score on 
criterion test 
SD 


14 32 62 
12 : 5 10 
33 s 


42 
58 
61 


c*7 
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considered fixed-ratio) cause different 
error rates on the program itself? Sig- 
nificant differences were obtained (F = 
4.61; df=3/81; p < .005). Further- 
more, a test of the linearity of the trend 
over the four degrees of confirmation, 
0%, 33%, 67%, and 100%, revealed a 
significant linear trend (F = 11.68, df= 
1/81; p < .001). That is, there is a 
significant negative relationship between 
the amount of confirmation and the 
number of errors made on the program 
itself; the more confirmation, the fewer 
program errors. 

2. Do these same four schedules pro- 
duce different scores on the criterion 
test? A similar analysis of the criterion 
test scores resulted in a nonsignificant 
F less than 1. Thus, the four degrees 
of confirmation in fixed-ratio schedules 
did not produce criterion test means 
sufficiently different to reject the null 
hypothesis. An attempt was made to 
gain further precision by computing an 
analysis of covariance by adjusting the 
mean criterion test scores for initial dif- 
ferences in students’ all-university grade 
point averages. The covariance analysis 
was abandoned, however, when the as- 
sumption of homogeneous regression was 
not found tenable (F =3.48; df=3/81; 
p < .05). As seen in Table 1, all cor- 
relations were approximately the same 
except for the one slightly negative re- 
lationship in the Fixed-ratio 67% group. 
No logical explanation for this dis- 
crepancy is apparent. 

3. What main and interaction effects 


TABLE 2 


ANALYSIS OF VARIANCE OF Errors ON PROGRAM 


Source df MS 


Percentage of confirma- 
tion 1 
Type of ratio 1 
Interaction 1 
Within 72 


KRUMBOLTZ AND Ronatp G 


WEISMAN 


TABLE 


ANALYSIS OF VARIANCE OF CRITERION TEST 
ScoRES 


Source df 


Percentage of confirma 
tion 

Type of ratio 

Interaction 

Within 


26.30 

15.09 
49.18 
47.29 


result from differences between type of 
ratio and percentage of confirmation? 
Here the Continuous confirmation and 
No confirmation groups were ignored, 
and a 2X2 analysis of variance of the 
other four groups was calculated on the 
program errors (Table 2) and the cri- 
terion test (Table 3). Differ- 
ences between fixed-ratio and variable- 
ratio schedules produced nonsignificant 
differences on both criteria as well as 
on the interactions with the percentage 
of confirmation. No evidence was ob- 
tained with the criteria used here on 
the differential effectiveness of fixed- 
ratio and variable-ratio schedules of con- 
firmation in programed instruction. 


scores 


DISCUSSION 


The results of the present study did 
include one effect common to intermit- 
tent reinforcement: an increase in error 
rate during learning under the higher 
ratio scredules. Groups receiving fewer 
confirmations made more errors during 
learning although the difference in error 
rate was relatively small indicating that 
considerable confirmation was probably 
present within the frames themselves. 

Thus, while it is clear that confirma- 
tion had some effect on the error rate, 
it is also clear that sequencing and cue- 
ing within the program itself were impor- 
tant. It is hypothesized that the effects 
of various schedules should become in- 
creasingly evident as more of the inter- 
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nal cueing is removed from the program. 
If confirmed, such a hypothesis would 
be consistent with some evidence sup- 
porting the hypothesis of Evans, Glaser, 
and Homme (1960) that the relevance 
of variables such as response mode and 
immediacy of confirmation is inversely 
related to the probability of correct re- 
sponding. For example, Goldbeck 
(1960) discovered an interaction be- 
tween response mode and the difficulty 
level of a “discrete discourse’ program. 
See Krumboltz (1961)-.for a discussion 
of the related evidence. 

The results of the criterion test did 
not give evidence that intermittent con- 
firmation improved retention. It is pos- 
sible, however, that an immediate re- 
tention test may not be sensitive to 
differences in schedules. Krumboltz and 
Weisman (1962) found that a retention 
test delayed 2 weeks was more sensi- 
tive to the effect of overt responding. 
This would argue that future experi- 
ments should include delayed measures 
of retention as well as immediate cri- 
teria. 

In any future experiment along this 
line additional precision could be ob- 
tained by arranging the schedules of 


confirmation separately for each con- 
cept. Many concepts and their inter- 
relationships were taught in the present 
program, and the schedule of confirma- 
tions ignored the particular concept em- 
phasized in each frame. Programs could 
be arranged so that various schedules 
would be tested for each major concept 
regardless of that concept’s order in 
total program. 
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CROSS-CULTURAL STUDIES OF TEENAGERS’ 
PROBLEMS * 


H. H. REMMERS 


Purdue University 


This paper reports findings from representative stratified samples of more 
than 5000 teenagers in school in the United States, Puerto Rico, West 
Germany, and India who responded to the Science Research Associates 
(SRA) Youth Inventory, a problems check list adapted to each of the 
cultures surveyed. Mean scores, reliability estimates, intercorrelations of 
subscale scores, and factor analyses of these matrices all lead to the con- 
clusion that (a) the measuring instrument is highly reliable, (b) teen- 
agers’ self-perceived problems can be comparably measured across widely 
different cultures, (c) rankings of problem areas across cultures are 
highly similar, (d) health problems are of least concern and post-high- 
school problems of most concern. The amount and intensity of worry 


varies greatly across cultures. 

Science and technology have in recent 
years tremendously shrunk the size of 
the earth in terms of time and space as 
these philosophical ultimates are re- 
lated to intercultural contacts. The dis- 
tance in travel time from San Francisco 
to Moscow is no greater than was the 
distance from, say, Chicago to St. Louis 


only four or five decades ago. The most 
distant peoples spatially have become 
near neighbors. Electronics has reduced 
communication to the farthest reaches 
of the earth to a matter of minutes; 
and for neighborhoods to live together 


peacefully in this rapidly shrinking 
world they need to understand each 
other—their cultures, their values, their 
problems. 

The cross-cultural studies of this r - 
port may contribute to increased un- 
derstanding of that segment of the 
population—the teenagers in school— 
whose socialization within their respec- 
tive cultures is going on apace and al- 
ready largely completed. From among 
them will come, within a short time, the 
opinion leaders of their communities. 


1 Given at the Atlantic City meeting of the 
American Educational Research Association, 
February 21, 1962. 


It is important to note that in the 
United States, at least, the teenagers 
have largely interiorized the attitudes 
and values of the older generation, their 
parents, who constitute the major por- 
tion of the adult population. The teen- 
agers’ political preferences quite pre- 
cisely predict the popular adult vote in 
presidential elections (Franklin & Rem- 
mers, 1960; Remmers, Baker, Fitzhugh, 
& Heath, 1956; Remmers, Horton, & 
Mainer, 1952). 

Agreement of parents and their teen- 
age children on a variety of attitudes 
has been shown to be high (Remmers, 
1959). Even in matters where literary 
myth would have violent disagreement 
between teenagers and their parents— 
when teenagers should be home from a 
party, their use of the family car, proper 
behavior on dates, their own appropri 
ate punishment for misbehavior, and 
many other issues—careful empirical 
research has shown much more agree- 
ment than disagreement between teen- 
agers and their parents (Hackett, 1951). 
In brief, the American teenager’s mold 
is apparently largely set. His attitudes 
or values posidict those of the preced- 
ing generation; and one might expect 
that this generalization is no less and 
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probably even more true of less open 
and permissive societies than ours. 


Source OF DATA 


The studies reported here are based 
on the findings of four doctoral re- 
searches done under the author’s super- 
vision and one independent heretofore 
unpublished study carried out in West 
Germany. All represent studies by 
means of translations and adaptations 
of the Science Research Associates 
(SRA) Youth Inventory, Form S* (de 
Lopategui, 1957; Kulkarni, 1961; Rem- 
mers & Shimberg, 1960; Shimberg, 
1949). 

Cross-cultural studies present meth- 
odological problems of translation and 
communication. A general assessment 
of such methodological difficulties is 
given by Hudson (1959) and others. 
Restrictions imposed by customs; tabus; 
values; political considerations; charac- 
teristic patterns of interpersonal be- 
havior, including hierarchies of authority 
and prestige; and the semantic difficul- 
ties inherent in translations may all di- 
rectly or indirectly exert their influence. 

The care with which the translations 
were made is exemplified by de Lo- 
pategui (1957). Three independent 
translations by two psychologists and 
a high school principal were compared, 
differences discussed, and a joint agree- 
ment reached. The Spanish version was 
then translated back into English by 
another psychologist unfamiliar with 
the original English form. This demon- 
strated that except for a few minor dis- 
crepancies which did not affect the 


2The out-of-pocket costs for these studies 


were provided by the Purdue Research 
Foundation from royalties on sales of the 
Youth Inventory. The author is greatly in- 
debted to E. F. Hylla, H. Roth, and F. Siillwold 
for the data from West Germany, staff mem- 
bers of the Hochschule fiir Internationale 
Paedagogische Forschung, Frankfurt am Main 


meaning, the Spanish translation was 
satisfactory. 


RESULTS AND DISCUSSION 


While the SRA Youth Inventory was 
orginally primarily designed as an in- 
strument on which to base individual 
counseling, a purpose the various trans- 
lations will serve, we are here chiefly 
concerned with possible generalizations, 
with uniformities across different cul- 
tures. Tables 1 and 2 show that the 
problems and the cultural values of dif- 
ferent cultures to be inferred from them 
show substantial similarities, thus vali- 
dating empirically an assertion made 
nearly 10 years ago (Remmers, 1954): 
Man .. . is, with the exception of a few ob- 
vious and largely irrelevant differences in color 
and size, everywhere the same. Everywhere he 
faces the same persistent life problems: to gain 
subsistence, shelter, and security, and to per- 
petuate the species, to form some sort of group 
life in which the individual can function in the 
common affairs of life, to regulate human con- 
duct through the transformation of simple im- 
pulsive behavior into behavior patterns con- 
sonant with the strivings of the group. This 
process of socialization or acculturation is per- 
haps the most distinctively “human” character- 
istic of the species. Psychologically, the values, 
the mores, the loyalties, the ideals, or, in short, 
the attitude patterns thus formed may be 
thought of as the dynamics of social behavior 
(pp. 13-14). 

The means of Table 1 were computed 
by weighting the responses to each item 
3, 2, and 1 for the large, the middle- 
sized, the small boxes and O for the 
circle, respectively. This “intensity di- 
mension” is the chief difference between 
Form S and Form A. On Form A, in 
contrast, the respondent merely checks 
any item he perceives as a problem. In 
view of the fact that the numbers of 
items in the SRA Youth Inventory dif- 
fer slightly from one culture to another, 
these means are only very roughly com- 
parable. That notwithstanding this they 
show such substantial correlation of 
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TABLE 1 


Rank Orper oF MEAN Scores or THE Ercut ProptemM AreAS oF THE SRA Youtu INVENTORY 


American 
Girls 


Problem area 
Boys 


38.38 
50.42 
40.72 
49.07 
28.89 
28.56 
20.07 
33.87 


1527 


40.99 
51.37 
34.26 
44.85 
27.52 


26.01 


. My school 

. After high school 

. About myself 

. Getting along with others 
. My home and family 

. Boy meets girl 

. Health 16.66 
. Things in general 30.58 


N 1465 
Rho .98 


PnNOUS YS Ww 


rankings is all the more striking, rang- 
ing as they do from .43 between German 
girls and Indian teenagers to .98 between 
American youngsters and German girls. 
The median of these rho’s is .74. 

Table 3 gives the reliabilities of the 
eight parts of the inventory and their 
intercorrelations except for the Spanish 
version. Unfortunately, the punched 
cards for it were inadvertently destroyed 
before the intercorrelations could be 
computed, 

Obviously, the numerical values of 
the reliability coefficients for each part 
are highly similar. The psychological 
similarity of the relative self-perceived 
importance of these problem areas again 
is apparent. 


TABLE 2 


RANK-OrpER CORRELATION OF MEANS oF Boys 
AND Girts CoMBINED 


Indian 
(Ma- 
rathi) 


German German 


ie 
Spanish boys girls 


Nationality 


1. American 71 93 66 
2. Spanish 79 62 
3. German boys 83 
+. German girls 43 


Note.—Decimal points omitted 


Indian 
(Marathi) 
Girls 


West German 
Girls 


Spanish 


Boys 


61.80 
48.78 
61.54 
64.55 
50.45 
22.49 


44.01 
56.09 
34.80 
44.99 
33.84 
34.30 
18.31 25.63 
37.04 61.37 


283 3 81 


To probe the meaning of these cor- 
relation matrices further factor analyses 
were computed.* 

A principle-factor solution was used 
to obtain the characteristic roots and 
vectors. The reliabilities for each inven- 
tory were used as estimates of the com- 
munalities. 

In each analysis the first factor ac- 
counted for about 65% of the total vari- 
ance. All of the latent roots were of 
magnitude 4.9. The factor loadings for 
the first factor for all the inventories 
show striking similarity as seen in 
Table 4. 

The second factor accounted for about 
9% of the total variance. The magni- 
tude of all these roots was in the neigh- 
borhood of .7. Thus the second and 
third factors were chosen with reluc- 
tance. Here the similarities of the fac- 
tor loadings are obviously rather dis- 
similar as compared to Factor I, in view 
of the first factor’s heavy loading, ac- 
counting for approximately two-thirds 
of the total variance. 

The third factor chosen accounted for 


approximately 7% of the variance on 


8 These computations were made by the au 
thor’s assistant, W. H. Payne 
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TABLE 3 


INTERCORRELATIONS OF THE EIGHT SUBSCORES 


Problem area Subject 1 


American 
Spanish 
German 


My school Boys 


Girls 
Marathi 
American 
Spanish 
German 


After high school Boys 


Girls 
Marathi 


American 
Spanish 
German 


About myself Boys 


Girls 
Marathi 
American 
Spanish 
German 


Getting along with others Boys 


Girls 
Marathi 
American 
Spanish 
German 


My home and famil) Boys 


Girls 
Marathi 
American 
Spanish 
German 


Boy meets girl Boys 


Girls 
Marathi 


American 
Spanish 
German 


Health Boys 


Girls 
Marathi 
American 
Spanish 
German 


Things in general Boys 9 61 


Note.- 


Girls 3 49 § : 39 
Marathi 58 39 


(99) 
(98) 
(88) 


54 
47 
59 


Decimal points omitted. Numbers in parentheses are coefficients of internal consistency 
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TABLE 4 


UNROTATED FACTOR 


LoapINncs FOR THE Matrices SHOWN in TABLE 3 








United 
States 
Il 


India 
Ill II 
08 
46 
30 
20 


Al 
46 
23 
Ol 
51 
— 11 
02 
=~" 


48 
A3 
AS 
03 
- 07 
28 
29 
36 


05 
38 
50 


all of the inventories and, again, the 
similarities of factor loadings were no- 
ticed. These roots were of the magni- 
tude .5. Doubtless, error variance 


would account for at least some of the 
observed dissimilarities. 

In general, then, the factor analysis 
again supports the similarity of response 
patterns and their psychological mean- 
ing across the cultures at issue. 


The averages of Table 1 clearly show 
considerable variation across cultures. 
In some cultures youngsters perceive 
themselves as having more and more 
important problems than others. To 
show this in qualitative form Table 5 
was prepared. It shows a sample of 20 
items with the percentage of “large box” 
responses for American, German, and 
Indian boys and girls. In western cul- 
tures American youngsters are signifi- 
cantly more worried about learning to 
read better and about getting people to 
like them (Items 1 and 118) than 
youngsters in the other cultures. Ger- 
man teenagers are more concerned about 
preparing for tests (Item 37), what 
people do in college (Item 44), what 
career to pursue (Item 59), being more 
help to their families (Item 182), and 
the possible elimination of slums (Item 
278). 

All of these problems, however, and 


Ill II 


12 
-.14 
26 
— .23 ; — Ol 
05 
Al 
2 8 — 28 


14 


Germany 
_ (Girls) 
II 


Germany 
(Boys) 


cae 
es 
os 


lil 


43 
53 


03 


o> 
ae 


a> > 


25 


- 15 


22 


wn Oe 
ao 


g 


00 


many more are perceived as “most 
serious” by a much larger proportion 
of Indian youngsters. “The heavy and 
the weary weight of all this unintelligi- 
ble world” evidently rests most heavily 
upon them.* 

In each of the four studies, here re- 
ported only in part, many relationships 
between inventory scores were ana- 
lyzed—sex, grades, rural versus urban 
residence, region, socioeconomic status, 
public versus private schools, caste in 
India, parents’ education, and religion. 
Space does not permit inclusion of all 
the many tables of quantitative data. 
Tables 6 and 7 are abstracted from Kul- 
karni (1961) and Taliana (1958) to 
give some indication of these relation- 
ships in the two most disparate cul- 
tures.° 

From these tables it is clear that the 
self-perceived problems are functions of 
a variety of environmental variables. 
In Table 6, of the 80 F values, 69 are 


*We have speculated as to whether these 
results may be a function of response set among 
Indian students not present at least among 
American students (Weisbrodt, 1959). S. S. 
Kulkarni intends to test the hypothesis experi- 
mentally. 

5 The interested reader can of course obtain 
these dissertations either through interlibrary 
loan or from University Microfilms, 313 North 
First Street, Ann Arbor, Michigan. 
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TABLE 5 


COMPARATIVE Percentaces or Some “Most Serious Prosiems” 


: American German Indian 
— Boys Girls Boys Girls | Boys Girls 


1. I want to learn how to read better 55 53 


I wish I knew how to study bette: 79 
I need to learn how to prepare for tests : 28 5 70 


I wish my teachers would give me en 


couragement. 63 


I wish I understood science better. ( y 60 
I worry about getting good grades y 23 3 4 5 65 


I want to know more about what 
people do in college. 


I wish I could afford college 

What career should I puisue? 

I feel guilty about things I’ve don 

I want people to like me better 

I wish I could carry on a pleasant « 
versation. 

I want to make new friends 

I need to develop more self-confidence 
I need to learn to be a “good sport” in 
games. 


It bothers me that some kids are left 
out of things. 


I wish I had a quiet place at home 
where I could study. 


I wish I could be of more help to my 
family. 
Is there any way of eliminating slums? 


How can I learn to use my leisure time 

wisely ? 
x 16.75 18.0 
N 1465 1,527 
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TABLE 6 


Mean COMPARISON 


SIGNIFICANCE TEST VALUES FoR SINGLE 


CLASSIFICATION 


ANALYSBS OF VARIANCE 


Rural 


Gr 
urban srade 


Problem area Region 


38.5** 
26.0** 
64.6** 
28.3** 
84.8** 
86.7** 
102.9** 
is o** 


41.6** 
28.6** 
94 g** 
29.6** 22.9** 
165.0°* 24.6** 
85.0** 5 

81.4** 20.3** 


27.9** 20 6** 


16.0** 
26.4** 
15.0°* 


My school 

After high school 

About myself 

Getting along with others 
Home and family 

Boy meets girl 

Health 

Things in general 


Note.—After Kulkarni (1961, pp. 154-162) 


* Significant at the .05 level 
** Significant at the .01 level 


significant at or beyond the .01 level, 
4 at the .05 level, and only 7 are non- 
significant. In Table 7, of the 56 values, 
44 are significant at or beyond the .01 
level, 5 at the .05 level, and 7 are non- 
significant. 

The data presented in this paper sup- 
port the following conclusions: 

1. Teenagers’ self-perceived problems 
can be comparably measured 
widely different cultures. 


across 


2. The high internal consistency co- 


Religion 


$.0°* 
ted 


Moth- 
er’s 
educa- 
tion 


Fa- 
ther’s 
educa- 


Socio- 
eco- 
nomic 
status 


Caste 


|S pa 


-* 
* 


ow 


MAGN NN 


*** 
*** 


NN SDS OUw 


efficients indicate a reliable measuring 
instrument. 

3. There is a high degree of similarity 
of rankings of problem areas across cul- 
tures. 

4. Health problems for teenagers of 
different cultures are of minimal con- 
cern, 

5. Post-high-school problems tend to 
be of major concern 

6. While the relative rankings 
problem areas are highly correlated, the 
amount and intensity of worry vary 
greatly across cultures 


of 


TABLE 7 


Mean COMPARISON SIGNIFICANCE TEST 
ANALYSES OF 


Rural 
urban 
t value 


Problem area Region 


My school 

After high school 

About myself 

Getting along with others 
Home and family 1.2 
Boy meets girl ; 2.7 
Health 2 2.2 
Things in general 4 1.1 


. 


Note.—Except ¢ values as noted. After Taliana (1958 


* Significant at the .0S level 
** Significant at the .01 level 


VALUES FoR SINGLE CLASSIFICATION 


VARIANCE 


Socio- 
eco- 
nomi 
status 


Mother's 
educa- 
tion 


Grade Religion 


19.7** 
&.3** 
6.9** 


27.7** 
15.6** 
10.8** 
10.4** 

9.0** 

6.7** 
13.2** 
12.4** 


4. 3*7 
| 
, 3ee 


6 8.2** 


2.7%* 


8 
1.7%* 7 
1 
7? 


5.9** 
15.5** 
4.5* 


7 979% 
. 


‘ 


3 OFF 





Cross-CuttTurAL Stupies oF TEENAGE 


PROBLEMS 


REFERENCES 


pe Lopatecut, Micvera. Needs and problems 
of Puerto Rican high school students. Un- 
published doctoral dissertation, Purdue Uni- 
versity, 1957. 

FraNKLIN, R. D., & Remmers, H. H. Youth 
looks at civil liberties and the 1960 election. 
Purdue Opin. Panel Poll Rep., 1960, No. 61 

Hackett, C. G. Use of an opinion polling 


technique in a study of parent-child relation- 
Purdue U. Stud. higher Educ., 1951 


ships. 
No. 75. 

Hupson, B. B. (Ed.) Problems and methods of 
cross-cultural research. J. soc. Issues, 1959 
15, 5-20. 

Kurxarni, S. S. A Marathi revision of the 
SRA Youth Inventory. Unpublished doctoral 
dissertation, Purdue University, 1961. 

Remmers, H. H. Introduction to opinion and 
attitude measurement. New York: Harper, 
1954. 

Remmers, H. H. Early socialization of atti- 
tudes. In E. Burdick and A. J. Brodbeck 
(Eds.), American voting behavior. Glencoe 
Ill.: Free Press, 1959. Ch 


Remmers, H. H., Baker, P. E., Firzuvcn, 
Katuieen B., & Heats, R. W. Musical 
knowledge and some social attitudes of high 
school youth. Purdue Opin. Panel Poll Rep., 
1956, No. 44 

RemMers, H. H., Horton, R. E., & Matver, 
R. E. Youth and the 1952 election. Purdue 
Opin. Panel Poll Rep., 1952, No. 33 

Remmers, H. H., & Surmeerc, B. Manual 
SRA Youth Inventory, Form S. Chicago 
Science Research Associates, 1960 

SurmsBerc, B. The development of a needs and 
problems inventory for high school youth 
Unpublished doctoral dissertation, Purdue 
University, 1949. 

Tariana, L. E. Youth’s problems as they see 
them: A statistical analysis and restand 
ardization of the SRA Youth Inventory. Un- 
published doctoral dissertation, Purdue Uni 
versity, 1958. 

Welssropr, J. A. The effects of response set on 
the SRA Youth Inventory. Unpublished 
master’s thesis, Purdue University, 1959 


Received March 5, 1962) 





Journal of Educational Psychology 
1962, Vol. 53, No. 6, 262-270 


PARENTAL IDENTIFICATION IN RELATION 
TO VOCATIONAL INTEREST DEVELOPMENT’ 


JOHN O. CRITES 
University of Iowa 


The purpose of this study was to derive and test 3 hypotheses about the 
relationship of parental identification to vocational interest development: 
(a) degree of identification with the father and mother correlates with 
vocational interest pattern, (b) kind of identification with the father and 
mother varies with vocational interest pattern, and (c) pattern of identi- 
fication is associated with masculinity-femininity of interests. Tests of the 
hypotheses were made on 3 groups of Ss and Ns equals to 100, 100, and 
150. Results from the Semantic Differential and Strong Vocational 
Interest Blank indicated that a son’s identification with both of his 
parents significantly affects the patterning of his interests, but that his 
identification with the father is more important. 


Several theories of vocational inter- 
ests suggest that, in addition to intellec- 
tual and attitudinal factors, the indi- 
vidual’s identifications with significant 
others, particularly his parents, affect 
the likes and dislikes he develops. Car- 
ter (1940) proposes that identification 
serves to narrow the range of a person’s 
interests and to focus them in areas 


of activity and endeavor which lead to 
the attainment of status and the imple- 


mentation of the self-concept. Bordin 
(1943) hypothesizes that the degree of 
identification with the father directly 
varies with the relationship of the son’s 
interests to the paternal occupation, the 
agreement being greater when (a) the 
occupation has high status value, (5) 
the emotional bond between father and 
son is close, and (c) the occupation is 
traditional in the family. Super (1953), 
in his theory of vocational development, 
observes that “childhood and adolescent 
identifications play a part in shaping 
vocational interests” (p. 188). 
Research on these propositions about 
identification and interests comes from 
two different types of investigations: 
Presented in abbreviated form at the annual 


convention of the American Psychological As- 
sociation, New York, September 2, 1961. 


one on the relationship between the in- 
terests of fathers and sons, and another 
on the relationship of independent 
measures of identification to interests. 
In two studies of father-son interest 
similarity, Strong (1943, 1957) obtained 
an average r equal to .30, but with a 
range which extended from .11 to .48. 
Berdie (1943) cites unpublished data 
gathered by Forster which agree with 
Strong’s results, the median coefficient 
being .33, as well as studies by himself 
and Dvorak which showed that the sons 
of skilled workers, businessmen, and 
physicians have interests like their 
fathers’, Using @Q sorts to measure 
identification, Stewart (1959) found 
that boys who perceive themselves as 
their mothers perceive them, both actu- 
ally and ideally, have more masculine 
interests. With the same methodology 
but using college female subjects, White 
(1959) concluded that girls who de- 
scribe themselves as both their parents 
see them have more feminine interests. 
Finally, Henderson (1958) and Steimel 
(1960) have reported questionaire find- 
ings on high school males which indi- 
cate that some relationships exist be- 
tween identification and (a) father-son 
interest similarity and (6) masculinity- 
femininity of interests. 
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PROBLEM 


The results of these studies generally 
confirm theoretical expectations, but 
they leave several questions about the 
role of identification in interest develop- 
ment unanswered: Do individuals who 
identify more or less closely with their 
parents have different types of voca- 
tional interests? Do individuals who 
identify more with the father or the 
mother develop different kinds of voca- 
tional interests? And, do individuals 
who identify with the appropriate pa- 
rental role model have interests which 
are consistent with their sex? To answer 
these questions, it is necessary first to 
formulate an acceptable definition of 
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identification. Although the concept has 
several connotations (Freud, 1946; 
Mowrer, 1950; Sanford, 1955), it usu- 
ally refers to the individual’s develop- 
ment of reaction patterns, attitudes, 
ideals, and values similar to those he 
perceives in his parents’ behavior 
(Stokes, 1950). As Knight (1940) 
points out, the process by which a child 
achieves an identification encompasses, 
first, a projection of some of his char- 
acteristics on to the role model and, 
second, an introjection of these attri- 
butes, along with others idiosyncratic 
to the model, into his personality (as 
depicted in Figure 1). 

When defined as behaviors which the 
individual patterns after those he sees 
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in his parents, it becomes apparent that 
identification varies along two different 
dimensions, one quantitative and the 
other qualitative. The first, degree of 
identification, expresses the extent of 
perceived similarity between the indi- 
vidual and his role model: he may iden- 
tify more or less with his father or 
mother. The second, kind of identifica- 
tion, represents differences in the ob- 
ject with which the individual identifies: 
he may identify with his father, mother, 
or both. Together, degree and kind of 
identification define the pattern of iden- 
tification an individual establishes with 
his parents: it may be any one of the 
possible combinations of the two dimen- 
sions, such as (a) complete identifica- 
tion with the (4) cross-sex parent. 


HYPOTHESES 


From the ways in which parental 


identification varies in degree, kind, and 
pattern and from the individual differ- 


ences which exist in vocational inter- 
ests, both in type and masculinity-femi- 
ninity, it is possible to derive a set of 
three testable hypotheses about identi- 
fication in relation to interest pattern- 
ing. These hypotheses, and the ra- 
tionales for them, are as follows. 

Degree of identification with the 
father and mother correlates with voca- 
tional interest pattern. The more an in- 
dividual tries to emulate the behavior of 
his parents, the more his interests should 
resemble theirs. For example, if a son 
identifies with a father who has the 
likes and dislikes of an engineer, then 
the son should have interests which are 
patterned in the same occupation or 
field (Bordin, 1943). 

Kind of identification with the father 
and mother varies with vocational in- 
terest pattern. According to Strong 
(1955), some interest areas are more 
masculine than others, e.g., the tech- 


nical versus verbal-linguistic occupa- 
tions. Therefore, an individual who 
identifies more with one parent than 
with the other, or who identifies with 
both parents, should have interests 
which are consistent with his role 
models. For example, a mixed-sex iden- 
tification should produce an interest pat- 
tern in occupations, such as those in the 
social service field, which allow an op- 
portunity for the expression of both mas- 
culine (e.g., doing research) and femi- 
nine (e.g., helping people) preferences. 

Pattern of parental identification is 
associated with masculinity-femininity 
of interests. An individual who identi- 
fies strongly with either his father or 
mother should have interests which are 
appropriately sex-typed. For example, 
a son who closely follows his mother’s 
example should have interests which 
are more feminine than those of a son 
who develops a strong identification with 
his father. 


PROCEDURE 
Measurement of Variables 


Parental identification. This variable has 
been defined as the perceived similarity of the 
individual to his parents, and the best measure 
of it appears to be the Semantic Differential 
(SD) (Bieri, Lobeck, & Galinsky, 1959; Lazo- 
wick, 1955; Osgood, Suci, & Tannenbaum, 
1957). For the measurement of parental iden- 
tification, the SD consists of nine bipolar, 
seven-point scales on which the subject rates 
his concepts of “self,” “father,” and “mother” 
with respect to opposed adjectives. The nine 
scales were selected and validated by Bieri et al. 
(1959) and are as follows: (1) potency 
(strong-weak, heavy-light, and rugged-deli- 
cate); (2) activity (fast-slow, hot-cold, and 
active-passive); and (3) evaluation (clean- 
dirty, fresh-stale, and happy-sad). On the rat- 
ing forms, the scales are arranged in random 
order and the sequence of concepts is varied 
from one page to another, so that systematic 
response sets are disrupted. Three identification 
scores are derived from the sum of the squared 
differences between the ratings for a pair of 
concepts (Osgood & Suci, 1952): (a) F = de- 
gree of father identification; (5) M — degree 
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of mother identification; and (c) FM =the 
difference between the F and M scores, or kind 
of parental identification. Patterns of identifi- 
cation (high F—high M, high F--low M, etc.) 
are obtained from combinations of F and M 
scores which fall in the high, average, and ‘ow 
ranges of these two variables, as ¢efin« 
+1 standard deviation about the mean.” 
Vocational Interest Patterning. The Strong 
Vocational Interest Blank (SVIB) was { to 
measure interests (Strong, 1943). tterns of 
interests were analyzed by a new .,stem pro- 
posed by Stephenson (1961), which enumer- 
ates all possible combinations of scores on the 
SVIB profile sheet for primaries, secondaries, 
and rejects in all 11 interest groups. A primary 
pattern on the SVIB, for example, consists of a 
majority of scores in the A and B+ score 
ranges, where these scores indicate a high degree 
of similarity between a subject’s interests and 
those of men successfully engaged in different 
occupations. For a measure of masculinity- 
femininity of interests, scores on the MF scale 
of the SVIB were utilized. 


Subjects 


The SD and SVIB were administered to 350 
male subjects subdivided into three groups: 
The Original group consisted of 100 vocational- 


2 In addition to identification, it is possible to 
measure both projection and introjection with 
the SD. The former would be the difference 
between a subject’s perceived similarity with 
either parent and the parent’s self-perception 
(D—parent’s self), and the latter wouid be the 
difference between a subject’s perceived simi- 
larity with either parent and his projection on 
to the parent [D—(D—parent’s self)]. In 
other words, D is composed of two compon- 
ents, projected self and introjected parent; and 
these can be defined by the arithmetical opera- 
tions given above. 


educational clients seen at the counseling service 
of the University of Iowa during the 6-month 
period from January to June 1960. The Repli- 
cation group numbered an additional 100 voca- 
tional-educational clients counseled during the 
next 6-month period from July to December 
1960. And, the Generalization group included 
150 nonclients tested in a large undergraduate 
psychology course during the winter semester of 
1960-61. The characteristics of these samples, 
with respect to age and intelligence, correspond 
closely to each other, to the general client and 
nonclient populations at the University of Iowa 
(local norms), and to client samples described 
in other studies (e.g., Merrill & Heathers, 1953). 
The means, standard deviations, and ranges of 
the three groups for age (to last birthday) and 
intelligence (composite percentile rank on the 
Iowa Scholarship and Placement Tests) are 
given in Table 1. None of the differences be- 
tween the group means on these variables was 
significant at the .01 level 


Method of Analysis 


The statistical analyses for each of the sam- 
ples were the same, the intention being to deter- 
mine how stable and valid the results were 
through replication and generalization. Simple 
analysis of variance was used to compare the 
subjects with primary and secondary patterns 
in different interest groups on F, M, and FM 
(Lindquist, 1953). A two-way analysis of 
variance was employed to test for trends in the 
means of the three samples on the dependent 
variables which were significant in the initial 
tests (Lindquist, 1947). Other statistical tech- 
niques, such as ¢ tests and product-moment 
correlations, were applied as necessary in fur- 
ther analyses of the data. The Ns for some of 
the analyses were less than the total samples, 
because a few subjects had unpatterned inter- 
ests 


TABLE 1 


Means, STANDARD DEVIATIONS, AND RANGES OF ORIGINAL, REPLICATION, AND GENERALIZATION 
SAMPLES ON AGE AND INTELLIGENCE 


21.14 
20.78 
20.34 


Original clients 100 
Replication clients 100 
Generalization nonclients 150 


Intelligence 


M SD Range 
4-97 
5-96 
1-99 


52.08 28.09 
$2.26 2031 
5 26.42 


Note.—None of the differences between the samples in age and intelligence was significant at the .01 level 
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TABLE 2 


INTERCORRELATIONS, MEANS, AND STANDARD DEVIATIONS OF IDENTIFICATION VARIABLES FOR THE 
OricinaL, REPLICATION, AND GENERALIZATION GROUPS 
(Ns = 100, 100, and 150, respectively) 


Variable 


Sample 


Original 
Replication 
Generalization 


Original 
Replication 
Generalization 
Original 

FM Replication 
Generalization 


Father 


Mother 


Note.—All r’s were significant at or beyond the .01 level. 
* Higher scores indicate less identification with the parents. 
> The r’s for M versus FM are negative because scores for the latter variable are obtained by subtracting M 


from F. 


RESULTS 


Statistical Assumptions 


The assumptions underlying the 'sta- 
tistical procedures appeared to be 
tenable. Edwards’ (1954) test for homo- 
geneity of variances indicated someidif- 
ferences between the score dispersions 
of the interest groups for the dependent 
variables, but these were largely negligi- 
ble and nonsignificant. The overall) dis- 
tributions for F and M_ identification 
measures were negatively skewed ‘and 
were consequently converted to T scales 
(McCall, 1939), with the effect of: ap- 
proximating normal distributions within 
interest groups. The masculinity-femi- 
ninity scores were normally distributed 
without transformation. 


Identification Variables 


Table 2 presents the data on the in- 
terrelationships and central tendencies 
of the identification measures for the 
three samples. The correlations for any 
two of the variables, e.g., F versus M 
or F versus FM, across the three groups 
were not significantly different at the 
01 level, as assessed by ¢ tests of 
extreme values. Similarly, the means 


Actually, the correlations between M and FM should be read as positive. 


were homogeneous, with one notable ex- 
ception. As tested by simple analysis of 
variance, the mean father identification 
scores of the Original, Replication, and 
Generalization groups were significantly 
different at the .001 level (F=9.78; 
df=2/347; poor=6.91). Separate ¢ 
tests showed that the subjects in the 
client samples identified considerably 
less with their fathers than the subjects 
in the nonclient sample. 


Hypothesis 1 


Tables 3 and 4 present the findings 
on the hypothesis that Degree of iden- 
tification with the father and mother 


TABLE 3 


ANALYSIS OF VARIANCE FOR VOCATIONAI 
INTEREST PATTERN AND DEGREE OF 
IDENTIFICATION WITH FATHER 

Sample MS 


Between 
Within 
Total 
Between 
Within 
Total 
Between 
Within 
Total 


Original 
clients 


Replication 
clients 


Generalization 
nonclients 


* .05 level of significance. 
O1 level of significance 
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TABLE 4 
ANALYSIS OF VARIANCE FOR VOCATIONAI 
INTEREST PATTERN AND DEGREE OF 


IDENTIFICATION WITH MOTHER 


MS 


Sample 


Between 29.67 
Within 54.70 
Total 84.37 
Between 68.22 
Within $8.12 
Total 126.34 
Between 163.13 
Within 68.78 
Total 231.91 


yriginal 
clients 


Replication 
clients 


Generalization 
nonclients 


* .OS level of significance. 


correlates with vocational interest pat- 
tern. They indicate acceptance of the 
hypothesis only for father identification. 
Evidently, the father plays a more 
prominent role in the formation of in- 
terest patterns than the mother, at least 
for males. As depicted in Figure 2, sons 
who identify closely with their fathers 
tend to have predominant interests in 
the business detail occupations, whereas 
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TABLE 5s 


ANALYSIS OF VARIANCE FoR VOCATIONAL 
INTEREST PATTERN AND KIND OF 
PARENTAL IDENTIFACTION 


MS 


» 


Sample 


321.44 
116.00 
437.44 
343.25 
147.41 
490.66 
240.22 

91.43 
331.65 


Between 
Within 
Total 
Between 
Within 
Total 
Between 
Within 
Total 


Original 
client 


Replication 
clients 


Generalization 
nonclients 


Ceeaenru OeKwo 


To 


* OS level of significance 
** O1 level of significance. 


those who identify only slightly with 
their fathers tend to develop primary 
interests in the verbal-linguistic field. 
Most sons identify partially with their 
fathers, however, and form interest pat- 
terns in a variety of occupational groups, 
including such diverse ones as physical 
science and social service. These find- 
ings are equally valid for the Original, 
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Fig. 2. 


Father identification means for SVIB primary and secondary interest 


patterns. 
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Replication, and Generalization samples, 
since trend tests revealed no significant 
differences between them on mean father 
identification scores for the several in- 
terest areas (F=0.92; df=18/256; 
Pos =1.75). 


Hypothesis 2 


Table 5 summarizes the results on the 
hypothesis that Kind of identification 
with the father and mother varies with 
vocational interest pattern. The data 
provide some support for it in each of 
the samples. As shown in Figure 3, like- 
sex identifications characterize males 
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who have primary interests in business 
detail occupations, whereas mixed-sex 
and cross-sex identifications typify those 
who have patterned interests in other 
fields, such as the technical and verbal- 
linguistic. These tendencies emerge, 
even though the Degree of identification 
with the father or mother may be low. 
In other other words, identifying at least 
minimally with either one or both par- 
ents correlates with the development of 
an interest pattern. This conclusion ap- 
plies to all three samples, since trend 
tests yielded no significant differences 
between them on FM (F=0.85; 
df=18/252; pos=1.75). 
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Hypothesis 3 


Finally, the data on the hypothesis 
that Pattern of parental identification 
is associated with masculinity-femi- 
ninity of interests support a signifi- 
cant covariation only in the Generaliza- 
tion group. In this sample, possibly 
due to its larger V, which makes the sta- 
tistical test more powerful, there are sys- 
tematic differences in the mean MF 
scores for males who identify more or 
less closely with the appropriate parent 
as a role model (F=3.24; df=3/16; 
p.os = 3.24). More specifically, close like- 
sex identifications produce masculine 
interests, close mixed-sex identifications 
result in a combination of masculine and 
feminine interest, close cross-sex iden- 
tifications yield feminine interests, and 
negligible identifications with both sexes 
lead to feminine interests. Stated some- 
what differently, the intensity of an in- 
dividual’s identification with either or 
both of his parents directly influences 
the masculinity-femininity of his in- 
terests. 


DISCUSSION 


The results on the relationships be- 
tween the various aspects of parental 
identification and vocational interest 
patterning tend to support some of the 
hypotheses but not others. As predicted 
(Bordin, 1943), F varies with the kind 
of vocational interest pattern which an 
individual develops: sons who perceive 
themselves as quite similar to their 
fathers have interests in different areas 
than those who see themselves as unlike 
their fathers. In contrast, M appears 
unrelated to interest patterning: evi- 
dently mothers have less impact on the 
likes and dislikes acquired by their sons 
than theory suggests (Stewart, 1959). 
FM, which expresses the individual's 
relative acceptance of the father or 
mother as a role model as well as degree 
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of perceived similarity with both parents 
(FM correlates significantly with F and 
M in each sample), relates to interest 
patterns as hypothesized: sons who iden- 
tify more with one parent than the other 
have primaries and secondaries in dif- 
ferent groups on the SVIB. Finally, 
pattern of identification may affect the 
masculinity-femininity of an individual’s 
interests, but the evidence on this rela- 
tionship is not conclusive, since a sys- 
tematic trend was found in only one 
of the samples. Considering the tests 
of all three hypotheses, the most general 
conclusion which can be drawn from the 
results is that for males identification 
with both parents influences the forma- 
tion of vocational interest patterns, but 
identification with the father is more 
important than with the mother. 

The means and intercorrelations of 
the identification variables, although in- 
cidental to the main hypotheses of the 
study, provide data which reveal an im- 
portant difference between clients and 
nonclients. The differences between the 
mean father identification scores of the 
client and nonclient groups indicate that 
males with vocational-educational prob- 
lems have not achieved adequate inden- 
tifications with their fathers, a conclu- 
sion also supported by no group differ- 
ences in mean mother identification 
scores. The questions raised by these 
findings, which seem reliable and well- 
substantiated, directly pertain to our 
traditional conceptualization and con- 
duct of vocational counseling at both 
the secondary and higher educational 
levels. For example, how reasonable is it 
to expect a client to develop an appro- 
priate masculine identification as a basis 
for the resolution of his vocational prob- 
lems within the 2 or 3 interview hours 
of typical vocational counseling? How 
effective are the usual techniques of 
test interpretation and presentation of 
occupational information in the facilita- 
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tion of the identification process? And, 
what are the implications of the male 
client’s underidentification with a father 
figure for the sex of his counselor? 
These and related questions need seri- 
ous consideration, if we are to adapt 
vocational counseling to what is known 
about the nature of vocational problems. 

Finally, the interrelationships of the 
identification measures are quite similar 
to those obtained in previous studies 
(Bieri et al., 1959; Payne & Mussen, 
1956; Sopchak, 1952). There are low 
positive correlations between degrees of 
father and mother identification and 
moderate positive correlations of each 
with kind of identification. In other 


words, the young adult male identifies 
with general familial patterns of atti- 
tudes and behavior as well as the dis- 
tinctive attributes and characteristics of 
each parent, much as Freud (1927) ob- 
served in his early writings on ego and 
super-ego development (pp. 40-44). 
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RESPONSES OF BRIGHT, NORMAL, AND EMH 
CHILDREN TO AN ORALLY-ADMINISTERED 
CHILDREN’S MANIFEST ANXIETY SCALE* 


NEIL A. CARRIER, KENNETH D. ORTON 
Southern Illinois University 
anp LESLIE F. MALPASS 
University of South Florida 


Groups of children 10-14 years old differentiated (by WISC IQ) into 
bright (mean IQ 129), normal (mean IQ 102), noninstitutionalized edu- 
cable mentally handicapped (EMH) groups (mean IQ 67), and an insti- 
tutionalized EMH group (mean IQ 67) were given the Children’s Mani- 


fest Anxiety scale (CMA scale) orally and individually. 


CMA scale 


scores generally differentiated the groups, and girls tended to have higher 
A scale and L scale scores than boys. Tentative evidence of a negative 
A scale IQ relationship as well as a possible acquiescence set in low 
IQ Ss suggest caution in using the CMA scale. 


The number of recent studies employ 
ing the Children’s Manifest Anxiety 
(CMA) scale indicates the popularity 
of this revision for children (Castaneda, 
McCandless, & Palermo, 1956) of the 
Taylor Manifest Anxiety (MA) scale 
(Taylor, 1953). However, the studies 
reported do not always show consistent 
findings. While normative and relational 
data on this instrument have been re- 
ported by a number of investigators 
(Castaneda, McCandless, & Palermo, 
1956; McCandless & Castaneda, 1956: 
McCandless, Castaneda, & Palermo, 
1956; Malpass, Mark, & Palermo, 
1960; Palermo, 1959), differences be- 
tween studies in the age and intelligence 
range of subjects tested, the method of 
administering the scale, and the kinds 
of other measures employed have prob- 
ably contributed to the discrepancies be- 
tween results. 

A previous study (Malpass et al., 
1960) compared the scores of groups of 
institutionalized and public school edu- 


1This study is part of research project 
No. 578 (SAE 8182) sponsored by the United 
States Office of Education Cooperative Re- 
search Program and by the Graduate School, 
Southern Illinois University. 
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cable mentally handicapped (EMH) 
children and normal children on the 
CMA scale. But, whereas most studies 
have administered the CMA scale as a 
pencil-and-paper group test, Malpass 
et al. (1960) met the problem of read- 
ing deficiency in the retarded subjects 
by using an oral-individual administra- 
tion procedure in testing them, but used 
the usual procedure in testing normal 
subjects. 

There are three major features of the 
present study: (a) the use of subjects 
selected and classified by the Wechsler 
Intelligence Scale for Children (WISC) 
into bright, normal, and EMH groups 
(the last category subdivided into insti- 
tutionalized and _ noninstitutionalized 
subjects), thus enabling a determination 
of whether the CMA scale differenti- 
ates these groups; (6) the use of the 
oral-individual method of administer- 
ing the CMA scale with all subjects, 
thereby avoiding the interpretation 
problem imposed by two different ad- 
ministration procedures that the Mal- 
pass et al. (1960) study encountered, 
and also providing additional normative 
data on this method of administration: 
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(c) the use of a slightly older group of 
children than has generally been used. 


METHOD 


Subjects 

Children ranging in age from 11 to 14 years 
from five southern Illinois communities were 
classified by WISC IQ into bright, normal, 
and noninstitutionalized educable mentally 
handicapped (NIEMH) subjects. The 32 
bright subjects (IQ interval 120-150) had a 
mean age of 12-7 years and a mean IQ of 
129. The 31 normal subjects (IQ interval 
90-110) had a mean age of 12-7 years and a 
mean IQ of 102. The 27 NIEMH subjects (IQ 
interval 50-80) were selected from five public 
school special classes. Their mean age was 
12-11 years and mean IQ was 67. A group of 
17 institutionalized educable mentally handi- 
capped (IEMH) subjects were tested at the 
Lincoln State School and Colony in Illinois. 
The interval of this group was also 50-80; the 
mean age was 12-3 years and the mean IQ 
was 67.2 


Procedures 


An oral-individual procedure for administer- 
ing the CMA scale was employed with all sub- 
jects. The experimenter carefully read each of 
the 53 statements to each subject recording 
each “yes” or “no” response as it was given.® 
A previous check had found that EMH sub- 
jects attend to this procedure and apparently 
comprehend the items (Malpass et al., 1960). 

Employing simple analysis of variance, each 
of the two scales—anxiety (A scale), and lie 
(LZ scale)—of the CMA scale was used as a 
criterion measure with overall group differ- 
ences determined separately by sex. Signifi- 
cance of differences between adjacent classifi- 
cation groups were then computed using ¢ 
ratios for independent samples. Zero-order 
correlations between A scale scores and both 
IQ and CA were obtained for each group and 
sex within each group. 


2 Grateful acknowledgment for assistance in 
the data collection phase of this study is made 
to school officials in Carbondale, Du Quoin, 
Herrin, Marion, Murphysboro, and West 
Frankfort, Illinois, and to J. Albaum, Super- 
intendent, and W. Chambers, Chief Psychol- 
ogist, Lincoln State School and Colony. 

8 Three of the IEMH subjects gave “yes” 
responses to every statement in the scale; one 
gave only one “no” response. Their data were 
discarded. 


RESULTS AND DISCUSSION 


The results as presented in Table 1 
show that overall group differences were 
significant at the .001 level for both 
sexes on both scales. In Table 2 it can 
be seen that the A scale significantly 
differentiated the classification groups 
excepting on the normal-NIEMH com- 
parison. 

There was some confirmation of sex 
differences in A scale scores as has been 
found in several studies (e.g., Castaneda, 
McCandless, & Palermo, 1956). How- 
ever, the only such sex difference sig- 
nificant at the .05 level was found for 
NIEMH subjects. While this is in ap- 
parent agreement with a similar finding 
in the Malpass, Mark, and Palermo 
(1960) study, problems in obtaining an 
appropriate NIEMH sample from a re- 
stricted population resulted in a dis- 
crepancy of 14 IQ points between sexes 
(means: boys, 72; girls, 58) which could 
account for this result. 

Compared to the study most similar 
to the present one in terms of kind of 
subjects used (Malpass et al., 1960), 
the means for the classification groups 
on the A scale show an irregular tend- 
ency to be somewhat higher. In the 
earlier study “normal” subjects (actu- 
ally including both normal and bright 
subjects as defined in the present study) 
scored a mean of 13.7. Combining our 
bright and normal groups produces a fig- 


TABLE 1 


ANALYSIS OF VARIANCE OF CLASSIFICATION 
Groups 


___A scale 
Source dj MS 


Boys 
Groups 3 
Error 64 
Total 67 

Girls 
Groups 3 
Error 35 
Total 38 


664.31 
$3.64 


469.79 
40.91 


Significant at the .001 level 





CHILDREN’S MANIFEST ANXIETY 


TABLE 2 


DirreRENces In A SCALE AND L ScALE ScorES BETWEEN SELECTED CLASSIFICATION GROUPS 








Group 


Bright, all 
Normal, all 
NIEMH, all 
IEMH, all 
Bright boys 
Normal boys 
NIEMH boys 
IEMH boys 
Bright girls 
Normal girls 
NIEMH girls 9 22.56 


IEMH girls 8 30.75 


Note.—+ ratios are for adjacent groups 


ure for comparison of 15.8. The IEMH 
subjects in the present study had a 
mean A scale score of 30.6, while those 
in the previous study had a mean A 
scale score of 25.9. However, the 
NIEMH subjects in the present study 
scored lower than those in the earlier 
one, 19.5-21.4. These differences may 
be due to chance variation, to the fact 
that our subjects averaged about a year 
older than subjects previously used, 
and/or to the influence of the oral-indi- 
vidual administration procedure. That 
the latter may produce some (probably 
small) elevation in A scores over those 
obtained by the usual procedure is not 
unlikely. This should be tested in a 
study specifically designed for this pur- 
pose. 

Significant differences between the 
classification groups on the JZ scale 


L Scale 
SD t ratio 
2.01 
70 
2.30 
3.28°** 


6.15*** 344%* 


(Table 2) were found between the nor- 
mal and NIEMH groups both for boys 
and for sexes combined, and between 
the EMH groups both for girls and 
for sexes combined. Not shown is the 
significant sex difference for IEMH 
subjects. In three of four comparisons 
a tendency is shown for the girls to 
have higher scores than boys. This is 
consistent with previous findings (Mal- 
pass et al., 1960).* 

Within-group correlations between 
A scale scores and IQ were computed. 
The correlation for bright subjects was 
—.01, for normal subjects, —.35 (sig- 
nificant .05 level), for all EMH sub- 
jects combined, —.31 (significant .05 
level), for NIEMH subjects, —.32, and 
for IEMH subjects, —.40. The all- 


* B. R. McCandless, personal communication, 
1961. 
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EMH-boys correlation of —.41 was also 
significant at the .05 level. Although 
based on samples of restricted variance, 
it is interesting to note that with but 
one exception (bright girls) all of the 
15 A scale—IQ correlations by sex and 
within-group were negative. 

When all classification groups were 
combined the correlation between the 
A scale and IQ was —.54 (.01 level); 
with the bright, normal, and NIEMH 
groups combined it was —.39 (.01 
level); with the bright and normal 
groups combined it was —.45 (.01 
level). It should be noted, however, 
that by thus combining groups more sub- 
jects are included in the tails of the 
distribution than would likely be found 
in a random sample of the population. 
The correlations presented are probably 
thereby inflated. 

The general impression from the cor- 
relations in the previous two paragraphs, 
however, is that control on IQ may be 
desirable in studies relating A scale 


scores to other variables. For example, 


several investigators have used the 
CMA scale as a measure in studies re- 
lating anxiety (or “drive’) and com- 
plex learning (Castaneda, Palermo, & 
McCandless, 1956; Palermo, Castaneda, 
& McCandless, 1956). If A scale scores 
are indeed negatively related to IQ, it 
is possible that the poorer performance 
of high anxiety subjects observed on the 
more complex tasks could be partially 
explained as a function of intelligence. 

Of interest is the finding that IEMH 
subjects as a group (and by sex) gave 
more responses representing anxiety than 
did the NIEMH subjects. This finding 
is in general agreement with the Mal- 
pass, Mark, and Palermo (1960) study. 
But experience with the testing of these 
groups suggests care in interpreting this 
finding. The investigators gained a 
definite impression of an “acquiescence 
set,” in questioning EMH subjects, 
which was particularly noticeable with 


the institutionalized subjects. Although 
neither the observation of such a tend 
ency nor speculation about its likely 
instrumental qualities in the life of the 
retarded person are original with the 
writers, a caution is thereby raised 
against naively assuming differences in 
“anxiety” or “drive level” between the 
groups solely on the basis of A: scale 
differences. 

In summary, it appears that the 
CMA scale administered orally and 
individually generally differentiated the 
classification groups of this study. A 
tendency was found for girls to have 
higher A scale scores and L scale scores 
than boys. But the negative correla- 
tions of IQ with A scale scores, and the 
suspicision of an acquiescence set in 
the EMH subject caution in inferring 
the bases upon which the observed dif- 
ferentiation was effected. 
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CHANGES IN STUDENT 


TEACHERS’ ATTITUDES 


TOWARD CHILDHOOD BEHAVIOR PROBLEMS 


GEORGE A. PINCKNEY 
University of Nebraska 


To determine the extent to which a course influences attitudes, 203 stu- 
dents in introductory educational psychology ranked 32 childhood be- 
havior traits on the basis of seriousness. Rankings were made during the 
lst and last week of classes, and changes in rank order were analyzed 
and compared with those of a control group of peers and with the rank- 
ings of clinicians in a study done a decade earlier. Precourse rankings of 
the control and experimental groups were not reliably correlated with 
the 1951 clinicians. On postcourse rankings the correlation between the 
control group and clinicians remained insignificant while the correlation 
between the clinicians and the experimental group was highly reliable 
suggesting that courses can change attitudes. 





Educators have long been aware of 
the difficulty teachers have in judging 
pupil behavior in its true psychological 
perspective. Teachers have a tendency 
to overrate the seriousness of behavior 
problems related to sex, dishonesty, and 
disobedience, while underrating shyness, 
sensitiveness, fearfulness, and other be- 
havioral manifestations which do not ex- 


press pupil antagonism toward authority 
in the classroom. 


Wickman (1928) found that teacher 
ratings of the seriousness of 50 behavior 
problems correlated negatively with rat- 
ings given by 30 clinicians. Rank-order 
correlation between the ratings of 28 
teachers in Cleveland and the clinicians 
was found to be —.22, while a group 
of 511 Minneapolis teachers correlated 
—.11 with the same 30 clinicians. 

This, and similar studies (Amos & 
Washington, 1960; Gage & Suci, 1951; 
Gronlund, 1950; Hunter, 1957; Thomp- 
son, 1940), have drawn attention to the 
need for more understanding and edu- 
cation on the part of student teachers 
concerning the behavior problems of 
young children. Teacher training insti- 
tutions have expended considerable 
quantities of both time and money in- 
vestigating the most suitable coursework 


and supervised experience programs to 
include in their respective curricula. 

Twenty years after Wickman’s study, 
Schrupp and Gjerde (1953) replicated 
the Wickman rating procedure to de- 
termine what, if any, progress had been 
made in teachers’ ability to interpret 
psychological problems manifested in 
child behavior. They found that differ- 
ences were still present in teacher-clini- 
cian ratings, but that the magnitude of 
these differences was not as great as it 
had been in the earlier study. 

This closer agreement between the 
classroom teacher and the clinician can 
be assumed to have been brought about, 
in part at least, by the general increase 
in public knowledge of basic psycho- 
logical concepts, as well as by the adop- 
tion of curriculum policies which re- 
quire one or more courses in psychology 
in teacher education programs. 

The present study was designed to 
investigate the extent to which changes 
in rating of childhood behavior problems 
are related to the experience in psychol- 
ogy courses for teachers college students 


METHOD 
Subjects 


The experimental group for this study con- 
sisted of 203 undergraduate students enrolled 
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in six sections of an introductory educational 
psychology course in Teachers College, Uni- 
versity of Nebraska. The control group was 
composed of 58 students enrolled in the same 
college, but who had not taken coursework in 
any area of psychology. All of the subjects in 
both the experimental and the control group 
were planning for careers in the teaching pro- 
fession. 


Procedure 


At the first-meeting of each section of the 
educational psychology courses the subjects 
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were presented with a list of 32 behavior traits 
selected from Wickman (1928). Instructions 
were given to rank these traits from the most 
serious to the least serious using a nine-point 
Q sort technique after Stephenson (1953). A 
list of the behavior traits under consideration 
in the present study may be seen in Table 1. 
Subsequent course material was presented in 
the usual manner for the remainder of the 
semester, with no further mention of the rank- 
ing being made. All reference to the Wickman 
(1928) study was carefully removed from lec- 
and assignments were 


ture notes, reading 


TABLE 1 


RANKING OF 32 BEHAVIOR TRAITS 


Experi- 
mental 
postranking 


Experi- 
mental 
preranking 


Traits 


(6.48) 
(5. 
(5. 
(5. 


(5 


Stealing 
Cheating 
Destructiveness 
Untruthfulness 
Cruelty 
Heterosexual 
activity 
Disobedience 
Obscene notes 
Temper tantrums 
Profanity 
Masturbation 
Selfishness 
Impudence 
Unhappiness 
Overcritical of 
others 
Resentfulness 
Unsocialness 
Smoking 
Domineering 
Suspiciousness 
Easily discouraged 
Laziness 
Tattling 
Enuresis 
Nervousness 
Cowardliness 
Fearfulness 
Interrupting 
Tardiness 
Sensitiveness 
Shyness 
Dreaminess 


52) 
29) 
28) 
24) 


(5.14) 
(6.45) 
(5.73) 


(7.45) 


11) 
97) 
96) 
(4.94) 
(4.94) 
(4.62) 
(4.56) 
(4.33) 
(4.15 


5(5.50) 
(5.08) 
(4.70) 
(6.06) 
(4.43) 
(5.00) 
(5.09) 
(4.75) 
(6.80) 


(5. 
(4. 
(4, 


(4. 
(4. 
(4. 
(3. 
(3 

(3. 
(3.! 
(3. 
(3. 
(3. 
(3. 


(2 


10) 
07) 
03) 
76) 
76) 


(5.05) 
(5.92) 
(6.26) 
(2.85) 
(4.92) 
(5.17) 
5(5.50) 
(3.79) 
(3.34) 
(4.21) 
(5.30) 
(4.24) 
(5.34) 
(2.61) 
(2.68) 
(4.91) 
(3.81) 
(3 19) 


13) 
09) 
98) 
(2.93) 
(2.84) 
(2.71) 
(2.62) 
(1.85) 
(1.53) 29 


preranking postranking 


31 
32 


1951 
clini- 
cians 


1928 
clini- 
cians 


1928 
teach- 
ers 


Control Control 


11 
20 
14 
19 


(5.94) 11.5 
(5.87) 
(5.53) 
(5.59) 


(6.84) 


(6.67) 
(6.06) 
(5.35) 
(5.43) 
(5.41) 
(5.2 
(4.84) 
5(4.34) 
(3.84) 
(4.65) 
(3.93) 
(4.03) 
(4.59) 
(3.30) 


(3.87) 5) 
(5.38) 
(4.61) 
(5.12) 
(5.29) 
(4.64) 
(4.83) 
(4.38) 
(4.68) 


(4.47) 
(4.50) 
(3.72) 
(2.62) 
2.5(4.34) 
(3.66) 
(3.62) 
(3.22) 
(3.47) 
5(2.90) 
(2.87) 
(3.61) 
(2.97) 
22 (3.37) 
26.5(2.90) 
31.5(2.03) 
30 (2.34) 
31.5(2.03) 


(3.19) 
(4.22) 
(3.64) 
(2.90) 
(3.51) 
(3.55) 
(3.68) 
(3.90) 
(3.29) 
(3.16) 
(3.03) 
(3.42) 
(2.01) 
(2.97) 
(2.80) 
(2.42) 
(1.61) 
(1.32) 


onr & & 


= Ge 


— ee hh bo 
Aw Dame uM ~T & 
na wn 


w 
_ 


Note.—Mean scores on the Q sort are shown in parentheses 
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screened to avoid, as much as possible, any 
reference to it or similar work. 

The subjects in the control group were 
chosen at random from sections of courses in 
history and principles of education, and were 
administered the Q sort with the same instruc- 
tions as the experimental group. 

The same procedures were followed with 
both the experimental and control groups dur- 
ing the final week of the school year. The Q 
sorts were scored on a nine-point basis, with 
the trait ranked as least serious receiving no 
points. A rank order of trait seriousness based 
upon mean Q sort scores was established for 
precourse ratings by both the experimental and 
the control groups, and a correlational analysis 
of these was made. 


RESULTS 


The rankings of the 32 behavior 
traits, together with mean rating for the 
experimental and. control groups, are 
shown in Table 1. The results of the 
correlational analysis of the precourse 
and postcourse rankings for both the 
experimental and the control groups are 
shown in Table 2. A ranking of the 32 
behavior traits based on the earlier work 
of Wickman (1928) and Gage and Suci 
(1951) was made for the 1928 teachers, 
1928 clinicians, and the 1951 clinicians. 
These results are also included in 
Table 2. 

The precourse ranking of the control 
group was found to correlate .939 with 


the experimental groups’ precourse 
ranking of the behavior traits. The pre- 
course ranking of the control group cor- 
related .854 with their own ranking of 
the traits at the end of the course. The 
postcourse ranking of the experimental 
group, however, correlated only .555 
with the control groups’ postcourse rank- 
ing and .608 with their own ranking 
made prior to the course. 

The control group correlated —.152 
with the 1951 clinicians on the pre- 
course rankings and —.029 with the 
postcourse ranking. The experimental 
group, on the other hand, correlated 
— .069 with the 1951 clinicians on the 
precourse ranking and .663 on the post- 
course ranking. 

The significance of the change of the 
experimental group from a correlation 
of —.069 with the 1951 clinicians in 
the precourse ranking to a correlation 
of .663 on the postcourse ranking was 
determined by transforming the rho’s 
to Z values and expressing the differ- 
ences as a relative deviate of the nor- 
mal curve as suggested by Wert, Neidt, 
and Ahmann (1954). This change was 
found to be significant at the .001 level 
of confidence. The same change for the 
control group was found to be nonsig- 
nificant. 


TABLE 2 


INTERCORRELATIONS AMONG Group Ratincs ON 36 Benavior TRAITS 


1951 
clinician 


1928 


Group Nate 
clinician 


220 
860** 


1928 teachers 

1928 clinicians 

1951 clinicians 

1961 control (precourse) 

1961 control (postcourse) 
1961 experimental (precourse) 


* Significant at .05 level 
Significant at .01 leve 


1961 
experi- 
mental 
(pre- 
course ) 


1961 
control 

(pre- 
course) 


1961 
control 
(post- 
course) 


3816** 
107 

— 069 
.939** 
901** 


764** 6S0** 

063 017 

152 029 
g54** 
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DISCUSSION 


It is apparent from Table 2 that the 
attitudes of the experimental group 
toward childhood behavior problems 
changed to a significant degree. A cor- 
relation of —.069 was found between 
this group and the 1951 clinicians on 
the precourse rankings. The rankings 
at the end of the course correlated .663, 
indicating a change was significant at 
the .001 level of confidence. No such 
change was apparent in reference to the 
control group. The precourse and post- 
course correlations of the controls with 
the 1951 clinicians were —.152 and 
—.029, respectively. 

The data would tend to substantiate 
the hypothesis that psychology courses 
may play a significant role in the for- 
mulation of teachers’ attitudes toward 
childhood behavior problems. An in- 


tegral part of this educative process 
must certainly be the attitudes which 
the instructor himself feels are impor- 
tant. These may be expressed formally 


in class lectures or may be interpreted 
by students from the less overt be- 
havior patterns of the instructor. Their 
relative contribution to the total change 
in attitudes is not known, but it must 
be assumed that they exert a larger 
influence on the experimental group than 
on the controls. This must be kept in 
mind when interpreting the data of the 
experimental group. 

It may be noted that of the five traits 
rated as most serious by the 1951 
clinicians (unsocialness, unhappiness, 
cruelty, fearfulness, and suspiciousness) 
four were among the traits registering 
the greatest change from precourse to 
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postcourse rankings of the experimental 
group. The greatest disagreement be- 
tween the clinicians and the experimen- 
tal group of the postcourse ranking was 
found on the traits of shyness, dreami- 
ness, and heterosexual activity. 

In conclusion, it seems apparent from 
the results of this investigation that in- 
troductory courses in psychology can 
meet the needs of student teachers for 
better understanding of the basic prin- 
ciples of behavior. 
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DISCERNMENT OF TEACHER CHARACTERISTICS 
BY TAT SEQUENCE ANALYSIS 


M. INNOCENTIA BURKARD 
Mount Mary College 


What are some of the underlying personality factors that distinguish 
teachers rated by their pupils at the extremes in teaching efficiency? 
300 religious women teaching in Grades 4 through 12 were rated by 
their classes (10,720 pupils). TAT Sequence Analysis of 1200 stories 
of high-rated and low-rated paired samples yielded clearly distinguishing 
characteristics in the 2 groups (p < .001). The high-rated teachers 
showed a basic disposition to respond to life realistically and construc- 
tively, while the low-rated ones did not. Results were verified by the 
successful assignment of the teachers to the high or low group on the 
basis of their positive or negative scores on the empirically established 


criteria. 


While many investigators have found 
differences in the observable behavior 
of teachers who were rated at the ex- 
tremes in teaching efficiency, not many 
have studied the underlying aspects of 
the teacher’s personality that might ac- 
count for the differences in overt be- 
havior (Barr & Jones, 1958). The pres- 
ent investigation was based on the 
hypothesis that such fundamental dif- 
ferences between teachers who are rated 
high by their pupils and those who are 
rated low can be reached by a disguised 
technique, specifically, a projective test. 
For this purpose the Thematic Apper- 
ception Test (TAT) was selected as 
the most promising for its penetration 
of the deeper motivational levels of the 
personality; and Arnold’s TAT Se- 
quence Analysis (Arnold, 1949, 1960) 
was judged the most objective method 
of analyzing and scoring the results. 


METHOD 


Criterion Groups 


The criterion groups were set up on the 
basis of pupil ratings according to the Di- 
agnostic Teacher-Rating Scale by Amatora 
(1950). The Scale consists of an Area scale 
calling for a rating on a five-point continuum 
in each of seven divisions: liking for teacher 
(I); teacher’s ability to explain (II); kind- 
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ness, friendliness, and understanding (III) ; 
fairness in grading (IV); discipline (V); 
amount of work required (VI) ; liking for les- 
sons (VII). Following this is a Diagnostic 
Check List consisting of 49 scaled statements. 
The latter are arranged in 7 intrascales of 
7 items for each of the above divisions. These 
items are in turn scaled so that their values 
extend from highest to lowest, with the fourth 
item in each intrascale at the midpoint. 

The original sample of this investigation 
consisted of 300 teachers, all of whom were 
religious women teaching in parochial and 
private elementary and secondary schools in 
the Midwest. Fourteen high schools and 23 
grade schools were represented. The teachers 
ranged in age from 20 to 68 years, and from 
1 to 48 years in teaching experience. The 
schools varied in size, the number of faculty 
members ranging from 1 to 40. In every case 
but two, the entire faculty was included in 
the rating. 

The raters were a total of 10,720 pupils, 
6,226 in Grades 4-8, and 4,494 in Grades 9-12 
They included both boys and girls, and, with 
the exception of chance absentees, the entire 
class of each teacher rated. 

The rating scale was administered by the 
investigator to each class in its own home- 
room. Pupils were not identified, and teachers 
only by code number. As a means of prepar- 
ing the pupils, the second part of the scale, 
the Diagnostic Check List of 49 items, was 
administered first. The pupils were asked to 
check each item that was true of this teacher. 
On the Area scale, which followed immedi- 
ately, the pupils were asked not to rate their 
teacher as called for on the scale but to rank 
her relative to all the other teachers they had 
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ever had. The numbers thus indicated rank 
positions and could be considered arbitrary 
weights or scores for purposes of calculation; 
for example, each teacher who was ranked at 
the top of a particular pupil’s teachers received 
a score of 5 for the area on which she was 
so ranked. 

Each teacher’s mean score on each of the 
seven areas was computed, using the desig- 
nated figure in the five-point scale as a score. 
Inspection revealed that these means became 
progressively lower from the fourth grade to 
the twelfth, indicating that the pupils became 
more critical and discriminating as they grew 
older. To have a common basis for compari- 
son, therefore, centile norms were set up in 
each of the seven areas for each of the fol- 
lowing grade levels: 4, 5-6, 7-8, 9-10, 11-12 
(Burkard, 1958; Innocentia, 1962). 

Out of the original 300 teachers a sample was 
made consisting of 25 pairs of elementary 
teachers and 25 pairs of secondary teachers, 
matched on age and intelligence. In the pair- 
ing process, such factors as size and location 
of school, grade level, and subject matter 
taught were also controlled as far as _ pos- 
sible. The high member of each pair had 
received rank scores in the upper centiles on 
five out of the seven areas of the rating scale, 
while the low member was at the opposite 
extreme in the same five areas. For pairing 
purposes, Divisions IV and VI, which corre- 
lated least with the rest of the scale and 
which were difficult to interpret because of 
varying regulations with regard to grading and 
assignments, were ignored. The median rank 
score of the elementary teachers on the re- 
maining five areas was 86.7 for the high group 
and 16.3 for the low; in the secondary sec- 
tions it was 85.5 for the high group and 23.9 
for the low. Table 1 presents the compara- 
tive statistics of the criterion groups on all 
the variables. 


Measures 


The TAT was administered to the 300 teach- 
ers in groups ranging from 2 to 23, each teacher 
writing 12 stories. Eleven pictures and the 
blank card were used in the following order: 
1, 2, 3BM, 4, 5, 6BM, 7BM, 8BM, 9BM, 10, 
16, 11. The standard directions were given, 
with emphasis on writing a story that con- 
tained a plot and an outcome. There was no 
time limit; the cards were numbered and dis- 
played about the room; and the stories were 
written in an unbroken period that varied 
according to individual needs, from 55 to 95 
minutes. In order to produce a comparable 


TABLE 1 


COMPARATIVE STATISTICS OF THE PAIRED 
Groups 


Secondary 
teachers 


High Low 
rank 


Elementary 
teachers 


High | Low 
rank | rank | rank 


Variables 


N 25 25 25 25 
Age Mean 
Experience * 
IQ» 


42.6 3 

Mean 20.8 9 
Mean 113.9 | 116 
Median 3 
1 


Mean 


00 me me Gey 
OOwau so 


85.5 
10.4 


N~™4 Owns 


Rank score® | 
TAT score ¢ 


* Years of teaching experience in private or parochial 
schools. 

>Gamma IQ on Otis Intelligence Test. 

© Centiles based on 5 rank scores. 

4 Number of positive statements as measured by the 
Scoring hon 


measure for the control of intelligence, each 
teacher, at another session, did Form Am, 
Gamma, of the Otis Quick-Scorimg Mental 
Ability Test. 


TAT Analysis 


The method of TAT analysis used in this 
study is less generally known than others and, 
therefore, requires some elaboration. Arnold’s 
Sequence Analysis has developed from the 
observation that a series of stories gives a 
sequential account of the writer’s life situation. 
This is observable when the series is considered 
from the viewpoint of the “import” of the 
stories taken in sequence as written. By the 
import (Arnold, 1960) is meant a brief con- 
densation of the moral of a story without 
interpretation (p. 342). It is a restatement 
in a generalized, abstract form of what the 
writer is saying by means of a particular, 
concrete situation. This may be illustrated by 
the following story taken from a TAT pro- 
tocol. 


1. Johnny pleaded with his father to take 
lessons on the violin, but when the teacher 
prescribed an hour’s practice daily, he found 
it hard to give up his play time. A little 
sulkily he sat making a decision. With a 
sigh for the fun his buddies were having, he 
picked up his violin and began his scales. 
Father was reading the newspaper in his 
armchair. 


In the above story the writer is “actually” 
saying: 
You may want something; but then, when 
you find out it entails work and sacrifice, 
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you may not want to stick to it; and you 
do so only because you can’t get out of it 


It is characteristic of these generalized state- 
ments always to follow a sequence. If the 
abstraction is correctly done, there will ap- 
pear an association among these statements 
from story to story; and this feature makes of 
the total protocol a more or less continuous, 
connected expression of the subject’s way of 


When used in a clinical situation the Se- 
quence Analysis reveals the client's area of pre- 
occupation or conflict. But the analysis also 
reveals how the client is handling his con- 
flict, since each statement expresses an atti 
tude that may be designated as either construc- 
tive or not constructive. In the above series 
for instance, Import 4 may be considered more 
constructive than any of the others 


looking at his world. A series of stories taken 
from the same protocol and succeeding the ex- 
ample above, illustrate this sequence (see 
Table 2). 


In the present research it was this latter 
aspect of the analysis, that is, 
the expressed attitudes, that was studied. No 
prior judgment was made regarding the “con- 


the nature of 


TAT Protrocot witm SEQUENCE ANALYSIS 


TABLE 2 


Story as written 


2. In early spring Laura’s father always had to work doubly 
hard, and her mother lost some of her beauty with each passing 
year. But Laura just couldn’t let all their troubles come be- 
tween her and her books. She just wouldn’t end up a farmer’s 
daughter, she thought as she turned away. 


3. Six yards of sheer organdy from Aunt Bess! This was 
what had assured Jean that she might accept John’s invitation 
to the prom. Coming home from school happily dreaming of 
pretty ruffles and soft music, Jean had found little Janice with 
mother’s scissors, and her dreams destroyed. She knelt at the 
window seat and wept bitterly. She knew her answer must 


be, “No ” 


Mary pleaded as she caught Joe’s shoulder 
in a wild effort to restrain him. Then she glanced around their 
home, for it was home in spite of the cheap prints and gaudy 
furnishings. Joe signed up though, because he too loved and he 
was a man. 


4. “Don't go,” 


5. “So she put them there again,” Mrs. Carol noted with 
some surprise. Every day the girl cut flowers and placed them 
on Mr. Jack’s study table. “I ought to stop these going-on,” 
she muttered. But Mr. Jack paid cash and on time. This is 
what stopped her. 


6. His mother stood with her back to him as he poured out 
his story. Why didn’t she turn and tell him he was a no-good 
son, gambling away his life and hers, wasting his wages, leaving 
her nothing but her widow’s pension to eke out her existence. 
But she wouldn’t. She turned and put her hand on his shoulder 
with a smile. 


7. A dream come true! The old father smiled a contented 
man’s smile as his boy poured out his determination to give 
up that alluring woman and to finish his engineering course. 
One more year and he would have a stalwart partner 


Import 
2. (But) you won't let any- 
thing, not even the troubles 
of your loved ones, come be- 
tween you and what you want 
most 


3. (And) when someone de- 
stroys your dream, and you 
know you cannot have it, you 
weep bitterly 


4. (But) if you are strong, 
you can give up what you 
have for a greater love. 


5. You may know what you 
ought to do, but you don’t do 
it because you fear to lose 
by it 


6. You know you are no 
good and admit it, but still 
they do not reproach you 


7. They keep on hoping in 
you as you tell of your deter- 
mination to give up what 
lures you away and to per- 
severe 
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Table 2—Continued 





Story as written 


8. It was by pure accident that the rifle shot had struck 
old Mike in the stomach. Albert had grown into a man as they 
stretched Mike out for the emergency operation. Only after 
they assured him that Mike would be all right did he turn 
away. There was a dream in his eyes and a determination. He, 
too, would save lives with his intelligence and steady hand. 





9. He raised his head and studied the sleeping faces around 
him. There was Tom sprawled out with his head on Joe’s 
stomach. Sam lay snoring. Everyone was relaxed; but he was 
tense, excited with his first taste of battle. In just one hour 
it would begin again. 

10. Now they had been married forty years and Jim was 
just going off to work as usual. Mary smiled as she said, “God 
bless your day.” Jim picked up his lunch pail and set out 
with the same old song in his heart. 


11. The boy stood with his hand on the railing, looking up 
at the door with the little rivers of rain running down its 
shiny varnish. With his left hand be reached in to feel the 
warm report card stiff against his heart. Father was home 
and there would be no rewards this time. 


12. The wounded lamb had to be left on the narrow path 
just before the men had ascended the rude bridge. A vulture 
circled over it menacingly. As the last man disappeared in the 
valley, the bird darted down and plunged its beak into the 
soft flesh. 


structiveness” of the statements, the object 
being to find out what kind of attitudes were 


setting up a 


‘ 


Import 


8. It may have been a tragic 
accident that brought you to 
that determination. 





9. (And) the first taste of 
the struggle may have excited 
you more than others. 


10. (But) after many years 
you are still content to do the 
same things in the usual way 


11. A day of reckoning will 
come and it will bring no 
reward. 


12. (And) when you are 
left behind wounded and un- 
protected, someone will be 
ready to strike at you 


‘scoring key” empirically 


expressed predominantly by each criterion 
group. If the imports of the groups differed 
to a degree that they successfully discriminated 
between the high and the low teachers when 
the latter were not identified as such, these 
imports would in effect express the differences 
in the two groups on the level of motivation 
that the TAT explored. 


ANALYSIS AND RESULTS 
Construction of the Scoring Standard 


The TATs of the 100 cases in the 
paired sample were analyzed according 
to Arnold’s method of TAT Sequence 
Analysis described above. The results 
were in the form of 12 sequential state- 
ments (1 for each of 12 stories) for 
each case. The protocols of [0 pairs of 
teachers were selected at random from 
the secondary group for the purpose of 


on the basis of the statements of these 
20 teachers. Such a scoring key, if it 
successfully discriminated between the 
high and low teachers when they were 
not identified as such, would in effect 
express the differences in the two groups 
on the level of motivation that the TAT 
explored. For this purpose the TATs 
of the 10 selected pairs were scrutinized 
for types of statements that appeared 
predominantly among the high teach- 
ers and not among the low, and vice 
versa. Statements that appeared only 
occasionally in either group and ex- 
pressed a basic disposition contrary to 
the predominant one were transferred 
to the opposite set. Two opposing 
classes of statements were thus built up, 
one of which represented the disposi- 
tions expressed by the high group, and 
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the other, those expressed by the low 
group. The former were designated 
plus ; the latter, minus. 

Inspection revealed that both plus 
and minus statements readily fell into 
five categories having to do with the 
following aspects of life: success, fail- 
ure, loss (including disappointments, 
injustice, and hardships), the obliga- 
tions of life, and other people. Each of 
the assembled statements was assigned 
to the category to which it had refer- 
ence. 

A trial test was made to determine 
whether these sets of statements could 
be used with the elementary teachers to 
distinguish the groups according to their 
rank. Ten pairs, identified only by code 
number, were selected at random from 
among the elementary teachers. The 
cases were shuffled, and the investigator 
tried to predict blindly, on the basis of 
the categorized statements alone, which 
teachers had been rated high and which 
low. This attempt demonstrated that 
the compiled statements were not suffi- 
ciently inclusive for a workable stand- 
ard. These elementary cases were, there- 
fore, studied for differentiating charac- 
teristics in the same way the first had 
been. The results were found to be of 
the same general nature as those of 
the secondary group; and they were 
combined with the latter, making the 
“key” more descriptive and, hence, 
easier to use with other protocols. 
Table 3 presents illustrative statements 
from the Scoring Standard, somewhat 
generalized, though following closely the 
phraseology of the Sequence Analysis. 


Application of the Standard 


The remaining 60 cases were then 
evaluated according to the Scoring 
Standard. The cases were not identified 
as to pairs nor rating. Each statement 
in each subject’s sequence analysis was 


marked either plus or minus according 
to the standard; and, on the basis of 
the total positive statements, each case 
was assigned to the high or the low 
group. Two errors were made in this 
attempt: one high-rated case was inac- 
curately judged low, and one low case, 
high. 

To test the results further the 100 
TAT protocols and the Scoring Standard 
were given to two other judges, doctoral 
students in psychology who had been 
trained in sequence analysis. The judges 
scored the protocols independently, and 
both accurately placed every teacher in 
the high or low group on the basis of 
her plus and minus scores. On the 1,200 
stories taken as a whole, there was 
97.2% and 96.6% agreement between 
the investigator and Judges A and B, 
respectively. Agreement between Judges 
A and B was 94.3%. 

Final scores were then assigned to all 
cases. The scores were based on the 
decisions of two out of the three 
judges—that is, if two judges agreed 
that a statement had plus value, the 
statement was given a positive score. 
Allowing 1 score point to each state- 
ment, the positive scores of the high 
teachers ranged from 8 to 12, with a 
mean of 10.2 in the elementary group 
and 10.4 in the secondary. The range 
of positive scores in the low groups was 
from 0 to 4, the mean being 1.9 for the 
elementary and 1.8 for the secondary 
(Table 1). 

For a measure of statistical signifi- 
cance of the differences between the high 
and the low teachers, the sign test 
(Siegel, 1956) was applied to the pairs 
of scores. Since all the differences were 
in the same direction, that is, in every 
pair of teachers the high-rated one re- 
ceived more plus scores, x (the number 
of fewer signs) was 0. For N=25, an x 
of this size is significant beyond the 
.001 level of confidence. 
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TABLE 3 


ILLUSTRATIVE STATEMENTS FROM THE SCORING STANDARD 


Plus 


Minus 


Categories A and B: Success and Failure 


Achievement depends on constructive action: 
effort, work, sacrifice, preparation; realistic 
adaptation; supernatural or ethical motives; 
help or advice from others. Resolutions to 
achieve require recognition of difficulties in- 
volved. Failure results from inadequacy in 
oneself; it can be overcome by constructive 
action. 


Success comes in spite of uncertainty, com- 

promise. Success is only a dream. The goal 

| is achieved easily, unrealistically, without con- 

structive action. Resolutions are made im- 

pulsively, without foresight or planning. Fail- 

| ure just happens, or is due to others. One is 

dejected or seeks to escape in dreams or in 

| easy work. Failure is not admitted, or is over- 

come without constructive effort. Failure ends 
in despair or frustration. 


Category C: Loss, Harm, Danger 


Loss may be overcome by constructive action. 
Negative emotions are overcome. Loss is ac- 
cepted when required by the nature of things 
or for the good of oneself or others or by one’s 
duty. There is hope in spite of loss. 


Something is actively done about a loss. 


| Loss is due to others and causes negative 
emotions. Loss is not accepted; it always 
remains a tragedy. Loss is accepted passively 
without effort to overcome it. Loss is averted 
by others or by fate. Loss leads to impulsive 
| or desperate action; it ends in despair or 
| destruction 


Category D: Life and Its Obligations 


Legitimate enjoyment succeeds; _ill-inten- 
tioned action is punished. Revenge is rejected, 
frustrated, punished. Duties are undertaken 
for rational reasons; they are the right thing 
to do; they help others. Right conduct is re- 
warded; love is a help in doing right. 


Category E: People 


Others are good, responsive, helpful. Good 
relationships are attained by cooperative ac- 
tion: good-will, accepting one’s limitations 
realistically, avoiding or escaping bad com- 
panions. 


To determine which areas of the rat- 
ing scale were imost significant in iden- 
tifying the teachers with plus charac- 
teristics, phi coefficients were computed 
between the teachers’ scores on the TAT 
and their rank scores on each of the 
seven areas. The two categories of TAT 


Duties are undertaken and decisions made 
because there is no way out, or just to please. 
Right conduct brings failure, loss; love makes 
life easier. The view of life is unrealistic or 
illogical: life is all work, worry, emptiness; 
life is all sunshine. Work is drudgery. Wrong- 
doing is condoned, or is regretted for any rea- 
son other than conviction of its wrongness. 





Others are frustrating, unforgiving, unhelp- 
ful. One gives in to others even though they 
are not right. One depends on others entirely. 
Good relationships are attained out of fear 
or not at all. Bad relationships lead to no 
real problems. 


scores were the number of teachers mak- 
ing scores of 0-4 and the number mak- 
ing scores of 8-12. (There were no 
scores between 4 and 8.) The rank 
scores were dichotomized at the fiftieth 
centile. Table 4 shows these correla- 
tions. The coefficients ranged from .37 
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TABLE 4 


RELATIONSHIPS (Pu1 COEFFICIENTS *) BETWEEN 
TAT Scores anp SCORES ON THE SEVEN 
AREAS OF THE RATING SCALE 


Areas of rating scale 
TAT score 


Elementary 9 
(NV = 50) 
Secondary 

(N = 50) 


& 
bse 


96 


Note.—All coefficients significant at .01 leve 

* Phi coefficients are restricted in size and are, there- 
fore, lower than Pearson r’s. The phi coefficients in this 
study are further reduced by the use of Yates’ correc- 
tion for continuity. These coefficients may, accordingly, 
be looked upon as conservative estimates of the corre- 
lation between the given variables 


to .96. Lowest relationships were with 
Areas IV (grading) and VI (assign- 
ments) on both levels; highest, with 
Areas I (liking for the teacher) and 
VII (liking for the lesson) in the grade 
school, and with Area I in the high 
school. 


DISCUSSION 


The study supports the hypothesis 
that underlying personality differences 
between teachers ranked high by their 
pupils and those ranked low can be 
found by TAT Sequence Analysis. 
While the teachers were ranked on the 
basis of their external classroom be- 
havior, the TAT revealed their motiva- 
tional pattern in the form of convictions 
or attitudes. These convictions or atti- 
tudes are remarkably different for the 
two groups. 

The complete Scoring Standard, de- 
rived as it is directly from the TAT 
protocols, delineates these differences in 
detail. With regard to achievement, for 
instance, there is a strong contrast be- 
tween what may be designated the ac- 
tive attitude of the high group and the 
passive attitude of the low. The teachers 
in the high group constantly reiterate 
the need of action on their part to 
achieve success or to overcome failure. 


They foresee difficulties and recognize 
the need of planning and preparation. 
For them, success follows on such ac- 
tivity. The low group, on the other 
hand, do not mention these requirements 
of success. According to their stories, 
success just comes easily or in some 
equally unrealistic way; or, again, it 
does not come at all in spite of construc- 
tive activity on the achiever’s part. This 
group generally attributes failure to ex- 
traneous causes, rather than to inade- 
quacies in themselves (Table 3, Cate- 
gories A and B). 

The same active and passive attitudes 
are apparent toward loss and hardship. 
Here the activity of the high group is 
directed toward overcoming the loss, or, 
if that is impossible, toward bearing it 
without self-pity, resentment, or other 
negative emotion. The low-ranked teach- 
ers appear overcome by losses, and 
either express futile hostility or grief, or 
remain inert (Category C). 

With regard to life and its obliga- 
tions, the high group reveals an objec- 
tive, principled view. Right is accepted 
just because it is right; and what is 
seen as good conduct is rewarded, while 
wrongdoing fails or is punished. The 
reverse is expressed in the imports of 
the low . up. To these teachers life 
appears all drudgery or all joy with no 
adequate reason for either. Again, their 
passive attitudes are apparent in an in- 
ability to do anything toward bringing 
about that joy or eliminating the drudg- 
ery (Category D). 

Toward other people the high-ranked 
group expresses a willingness to go along 
reasonably, seeing them as good and 
helpful. The low group, on the con- 
trary, sees people either as hostile or as 
friendly in a sentimental way. Again, 
adequate reasons for the hostility or 
friendliness are not apparent. Good re- 
lationships are seen as unattainable, or 
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if attained, as leading to no good re- 
sults (Category E). 

The fact that these attitudes were 
given in a projective test suggests that 
they are habitual ways of thinking and 
feeling of which the owner is for the 
most part only dimly aware. Because 
of their habitual nature, the convictions 
are undoubtedly strongly motivational, 
inducing actions that are consonant with 
them. Further evidence of this is the 
high correlation between the TAT scores 
and rank scores, suggesting that the 
attitudes expressed in the TAT re- 
sult in actions that account for the cor- 
responding ranking. 

If this is true, the teacher ranked 
high would appear to be an active per- 
son, a hard worker, who recognizes her 
own limitations and also the demands 
of achievement; a person who takes an 
objective, realistic view of things and 
whose judgment is not clouded by nega- 
tive emotions. Her classroom behavior 


is probably similarly objective, well or- 


ganized, and efficient. 

The low-ranked person appears to be 
the reverse of all these. She expects 
success without working for it; and, 
when faced with failure or difficulty, 
acts in a less reasonable way. Appar- 
ently not seeing the relationship between 
the events of life and her own initia- 
tion of, or reaction to them, she tries 
to explain life in terms of accident or 
the ill will of others. Carried over to 
the classroom such attitudes probably 
result in poorly planned lessons, in- 
efficient procedures, and emotional re- 
actions to the consequent failures. 

Several unanswered questions present 
themselves. Do the expressed attitudes 
denote several characteristics or could 
they all be reduced to, let us say, ra- 
tionality versus emotionality? Is the 
unrealistic way of seeing things just one 
more effect of an inability to direct 
emotion reasonably? Or is there a cog- 


nitive factor operating here in addition 
to emotional control? This area needs 
to be further explored. 

The scoring system used in this study 
was very simple, merely giving one 
point for each plus statement, and not 
allowing for differences in the degree 
to which the positive or negative attitude 
is expressed. A 4-point scoring system, 
ranging from +2 to —2, has been de- 
vised since this study was completed, 
providing more flexibility in _ scoring. 
Further work needs to be done on this. 

It may be of interest to point out 
that the criterion in this study was 
not teaching efficiency as such, but 
rather the pupils’ rating of teaching effi- 
ciency. Whether these are the same 
thing or not has never been demon- 
strated empirically. Investigators of 
teaching efficiency have labored under 
the difficulty of finding a valid, relia- 
ble criterion. The objective here was to 
locate differences in the teachers as the 
latter. were evaluated by their pupils. 
The results seem to indicate that the 
religious woman who is ranked high by 
her pupils is a high-caliber person as 
far as the dimensions measured by TAT 
Sequence Analysis are concerned. 

Again, are the pupils merely react- 
ing to a “well-liked” person when they 
rank a teacher high? From Table 4 it is 
apparent -that the highest correlations 
with TAT scores are to be found in 
Area I, “liking for the teacher.” Area IT, 
having to do with how clearly the teach- 
ing is done, also correlates high with the 
scores; but how much of this is halo 
effect? On the other hand, the pupils’ 
rating is based, not on samples of the 
teacher’s classroom behavior, but on 
all of it. This fact, coupled with a con- 
sideration of the calibre of the high- 
ranked and the low-ranked teacher, sug- 
gests that the pupils’ rating might be as 
true an estimate of the teacher’s effi- 
ciency and personality as the ratings of 
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part-time observers or the indications of 
other criteria. 

In spite of these unanswered ques- 
tions the study reveals the divergent 
ways of thinking of the two criterion 
groups. The fact that the subjects were 
all religious women limits the degree to 
which the scoring criteria can be applied 
to teachers in general. It is reasonable 
to propose, however, that the technique 
of TAT analysis used here, since it suc- 
ceeded so well in distinguishing high 
and low groups, could well be used to 
set up scoring criteria for other teachers 
and eventually to predict teaching suc- 
cess. 
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THE RELIABILITY OF MEASURES OF 
HANDWRITING QUALITY 


LEONARD S. FELDT 
State University of lowa 


This paper reports a reliability analysis of measures of handwriting 
quality. Components of variance techniques are employed to assess the 
magnitude of measurement errors arising from various sources. The data 
indicate that the reliability of quality measures, as typically obtained 
in elementary school classrooms, is quite low. In Grades 1 and 2 the 
coefficients for measures based on a single day’s writing sample range 
from .4 to .6. Systematic variation among teachers in the harshness 
of their judgments is shown to be a particularly important source of 
error. Suggestions are offered for improving the reliability of handwriting 
evaluations. 


Scales for evaluating the quality of 
pupil handwriting were among the first 
standardized measuring instruments to 
be used in American schools. Thorn- 


dike’s Scale for Handwriting of Chil- 
dren was published in 1910, several 
years before the first intelligence tests 
found their way into the classroom. 
Ayers’ (1912) A Scale for Measuring 
the Quality of Handwriting of School 


Children, which is still widely used, and 
Freeman’s (1915) An Analytical Scale 
for Judging Handwriting appeared soon 
after the Thorndike scale in the period 
1912-15. 

Although handwriting scales have 
been in use more than 50 years, rela- 
tively little research has been reported 
regarding their reliability. Only one 
of the commercially distributed scales 
was backed at publication by reliability 
data, and virtually no studies have sub- 
sequently appeared which deal specifi- 
cally with this problem. The major pur- 
pose of this investigation was to gather 
such data for one of the more recently 
published handwriting scales. 


EXPERIMENTAL DESIGN 


The present study was concerned 
with the reliability of the Evaluation 
Scales, Grades 1 and 2, which accom- 


pany Guiding Growth in Handwriting 
(Freeman, 1959). These two scales are 
used to evaluate manuscript writing 
and include specimens representative of 
five levels of quality. Instructions for 
using the scale indicate that pupil writ- 
ing samples may be assigned scores 
higher than 90, the value for the best 
of the scale specimens, or lower than 
65, the value for the poorest of the scale 
specimens. These directions were fol- 
lowed in carrying out the study. 


Subjects 


The subjects who took part included 30 first 
grade pupils and 29 second grade pupils enrolled 
in the University Elementary School of the State 
University of Iowa. This school uses the series 
Guiding Growth in Handwriting (Freeman, 
1959). The pupil population of the University 
Elementary School is considerably above aver- 
age in intelligence and achievement. Many of 
the children are sons and daughters of univer- 
sity faculty members and probably receive 
more parental encouragement in their school 
work than do typical public school children. 
However, their achievement in handwriting is 
not above average. The data reported below 
do not reflect the superiority which generally 
characterizes their average achievement in 
other school subjects 


Procedure 


Each child contributed three samples of writ- 
ing, following the procedures specified in the 
teacher’s edition of Guiding Growth in Hand- 
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writing. The first sample was obtained on 
December 8, the second on December 13, and 
the third on December 16, 1960. The separa- 
tion’ of testing dates was considered adequate 
to permit study of day-to-day fluctuations in 
quality. It was sufficiently short, on the other 
hand, to rule out any real improvement in 
overall writing quality. 

The three judges who evaluated the writing 
samples were drawn from the enrollees in an 
Iowa University course in kindergarten and 
primary teaching methods. Two of the judges 
were graduate students and one was a senior 
undergraduate. All of the judges had had some 
teaching experience in the lower elementary 
grades. Preliminary investigation suggested 
that the judges needed supervised practice in 
the use of the Evaluation scales. Three practice 
sessions were held, in which samples of hand- 
writing were projected on a screen via an 
opaque projector. The judges viewed the sam- 
ples, made a tentative evaluation of quality, 
compared ratings, and then discussed their dif- 
ferences. The purpose of these sessions was to 
sharpen perceptions of overall quality and to 
develop consistent standards regarding the em- 
phasis to be given the individual aspects of 
slant, spacing, clarity of letter form and letter 
size. 

In the experiment proper, the three judges 
evaluated the test samples independently. The 
papers were identified only by pupil number, 
and submitted to each judge in a different ran- 
dom order. Each judge recorded his ratings on 
a separate score sheet seen only by himself 


The statistical method used to analyze 
the data was that known as the com- 
ponents of variance technique. This 
methodology yields not only reliability 
coefficients but also independent esti- 
mates of the error variance arising from 
various sources. Five categories of 
measurement error were hypothesized 
for ratings of handwriting quality: 
(a) momentary variations in the quality 
of pupil writing, as reflected within the 
sample taken in a single testing session; 
(6) undependability or inconsistency 
arising from the judging process; 
(c) day-to-day fluctuations in the 
quality of the writing performance; 
(d) systematic variation among judges 
in the emphasis given various aspects 
of quality; and (e) systematic variation 


among judges in the overall leniency 
with which they judge writing samples. 
The experimental design used in this 
study yielded separate estimates of the 
last three components of error variance 
and a combined estimate of the first 
two. While independent estimates of 
the first two components could have 
been obtained, the additional labor this 
would have required of the judges did 
not seem justified. 


RESULTS 


The analyses of variance for Grades 
1 and 2 are presented in Tables 1 and 
2; the estimates of the several variance 
components for each grade are given 
in Table 3. The types of factors repre- 
sented by each component are sum- 
marized below. 

o*.: Variation in handwriting quality 
within a sample taken on a given 
day. Also reflected is error of 
measurement arising from sub- 
jectivity of the judgment process. 


Day-to-day fluctuations by in- 


dividual subjects, beyond that 
attributable to o*,. (This factor 
is referred to as day xX subject, 
D x S, interaction. 


Inconsistencies among judges in 
their ranking of all pairs of sub- 
jects, beyond that attributable to 
e. This component reflects in- 
consistencies arising from differ- 
ent subjective weightings of the 
elements of quality. (This fac- 
tor is referred to as judge x sub- 
ject, J x S, interaction.) 


ao 


Variations in harshness from one 
judge to another. 


True variation among experimen- 
tal subjects; score variation aris- 
ing from sources other than er- 
rors of measurement. 
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TABLE 1 


ANALYSIS OF VARIANCE OF QUALITY SCORES 
oF GRADE 1 





Source df 
Subject (S) 
Day (D)* 
Judge (J)” 

» ><. 3° 


te 
oS 


MS 
290.4425 
1.7175 10 
155.1575 17.11* 
5.3875 1.23 


16.4600 
9.0675 
4.3700 


DxS 
Jxs 
oe De 


=u uM 
oak NM & 


_ 
toa) 





® Fp = MSv/MSp«x:s. 

> Fx = MS3/MSi x2. 

© Foxs = MSpx3/MSpx 3x. 
*p< Ol. 


TABLE 2 


ANALYSIS OF VARIANCE OF QUALITY SCORES 
or GRADE 2 


Source df 
Subject (S) 28 
Day (D)* 2 
Judge (J)" 2 


200.9175 
28.5175 
165.2600 
9.7575 


4 

56 22.1600 
11.9275 
6.8800 


* Fp = MSp/MSpv «xs. 

» Fy = MS1/MSixs 

© Foxy = MSp«3/MSp.« 
"a9 < Mi. 


TABLE 3 


ESTIMATES OF VARIANCE COMPONENTS FOR 
Qvatity Scores: GRADES 1 AND 2 


C : Estimate 
pa Grade 1 Grade 2 
Ce 4.3700 
. Ovxa 4.0300 
» Oixs 1.5650 
. ey 16.1200 
os 29.9200 
Standard error 3.1567 
of measurement 
(Components 1-3) 


6.8800 
5.0925 
1.6825 
16.7175 
19.3000 
3.6952 


A comparison of the four error vari- 
ance components reveals that judge-to- 
judge differences in overall harshness or 
leniency, as reflected in o*,, is by far the 
most serious. This is true in both grades 
studied. The error variance arising from 
the measuring procedure itself, o*,, is 
not extremely large, considering the mag- 
nitude of true subject differences, oy. 
Day-to-day fluctuations by individual 
subjects, o*p.s, represent about as potent 
a source of error as the within-day, 
within-judge component. The judge-to- 
judge inconsistencies, on the other hand, 
were rather small. The size of o°;,,, 
relative to o*,, suggests that judges were 
weighting the several aspects of the 
writing in much the same way in arriv- 
ing at their judgments. The error com- 
ponents for Grades 1 and 2 are fairly 
consistent in size. It appears that judges 
encountered no greater difficulty in ap- 
plying one grade scale than the other. 
There did appear to be greater hetero- 
geneity in writing quality within the 
first grade population than within the 
second grade population. This inference 
can be drawn from the comparative size 
of o*g for the two grade groups. 

A number of reliability coefficients 
may be derived from these components, 
just as there are a number of correlation 
approaches by which test reliability may 
be approximated. The essential differ- 
ences among the coefficients derived by 
either method arise from differences in 
the handling of the individual error com- 
ponents. For example, one might use two 
judges to score a single set of papers and 
correlate the results. Or one might have 
the first judge score writing samples col- 
lected on Tuesday and the second judge 
score a new set of samples collected 
from the same children on Thursday. 
The correlation between the latter pairs 
of judgments would almost certainly be 
lower than the first correlation, since 
the day-to-day fluctuations of the pupils 
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are reflected in the second coefficient and 
not in the first. A number of reliability 
coefficients for each grade are reported 
in Table 4. The coefficients were de- 
rived using formulas presented in Lind- 
quist (1953). To clarify the differences 
among these coefficients, each is de- 
scribed in terms of the hypothetical cor- 
relation coefficient to which it corre- 
sponds. It may be noted that several 
of these coefficients involve reuse of 
the same writing samples or repeated 
judgments by the same judge. Such 
coefficients are based on a restricted 
definition of errors of measurement. 
Measurement authorities agree that the 
preferred reliability coefficient is that 
value which reflects the measurement 
error arising from all sources. Ac- 
cording to this principle, a coefficient 
such as the first one suggested above 
would be unsatisfactory. It tends to ex- 
aggerate the reliability of the evaluation 
process, since it ignores sources of error 
which have actually affected the chil- 
dren’s scores. If all error sources are 
taken into account, the fourth coefficient 
represents, for a single measure, the most 
realistic appraisal of reliability. 


DISCUSSION 


By present-day achievement test 
standards, the reliability coefficients re- 
ported in Table 4 are rather low. Those 
for Grade 2 are particularly disappoint- 
ing. The standard error of measure- 
ment, even when variation in judge 
harshness is excluded, amounts to over 
half the difference between the scores of 
adjacent scale specimens. However, these 
data should not be taken as evidence 
that handwriting scales do not serve a 
useful purpose. Handwriting constitutes 
an important area in the elementary 
curriculum, and scales of this type ap- 
pear to provide the best means for hand- 
writing evaluation. Thus, one might 
look for ways in which reliability can 


rABLE 4 
ReviaBitity EstrmMates ror GRADES 1 AND 2 


Value for 
current study 


Grade1 Grade 2 





Correlation counterpart 


Scores assigned by Judge A851 740 
correlated with scores as 
signed by Judge B for the 
same writing samples. 

Scores assigned by Judge A 
on Sample 1 correlated with 
scores assigned by Judge A 
on Sample 2 taken several 
days later. 

Scores assigned by Judge A 
on Sample 1 correlated with 
scores assigned by Judge B 
on Sample 2 taken several 
days later. 

Scores assigned by Judge A 
on Sample 1 correlated with 
those assigned by Judge B on 
Sample 2 taken several days 
later; judge differences con- 
sidered as error of measure- 
ment. 

The mean of two 
(per pupil) taken on Day 1 
and evaluated by Judge A 
correlated with the mean of 
two scores taken on Day 2 
and evaluated by Judge B. 

The mean of three scores 
(per pupil) taken on Days 1, 
2, and 3 and evaluated by 
Judge A correlated with the 
mean of three scores taken 
on Days 4, 5, and 6 and eval 
uated by Judge B 


scores 


be raised, rather than rejecting the scale 
for its weaknesses. Two avenues can 
be suggested by which reliability may 
be improved: (@) averaging the scores 
from several independent testing ses- 
sions, and (6) providing additional train 
ing materials for teachers. 

The possibility of raising the relia- 
bility of quality scores by averaging the 
ratings for several independent samples 
is obvious. In view of the importance 
attached to evaluations of growth from 
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the beginning to the end of the.year, it 
would be highly desirable to secure an 
average based on at least three test writ- 
ings at each time of year. Additional 
steps could also be taken to improve 
the reliability of the scales. With more 
complete instructions on the emphasis 
to be given the various aspects of 
quality, teachers could probably de- 
velop more uniform standards in their 
judging. It seems likely that 2 or 3 
hours of practice by a school faculty, 
using samples of pupil writing projected 
on a screen, would pay substantial divi- 
dends in increased reliability. This prac- 
tice would be especially valuable for 
beginning teachers who have had little 
experience with handwriting scales. To 
reduce judge-to-judge differences in their 
overall leniency, publishers might con- 
sider supplying a set of practice speci- 
mens with each grade scale. These speci- 
mens, covering a suitably wide range of 
quality, could be reproduced on a set of 
cards, one sample to a card. On the re- 
verse side of each card could be noted the 


average score assigned by the publisher’s 
panel of judges. Such a practice deck 
could be quite helpful in bringing teach- 
ers closer together in their overall aver- 
age ratings. 

While this study has been concerned 
primarily with the issue of reliability, 


problems of validity clearly merit fur- 
ther investigation. The relationship be- 
tween handwriting scale scores and 
quality measures derived from everyday 
writing samples should be considered. 
The content of the test sentences should 
be carefully analyzed to determine if 
current scales include those letter com- 
binations, words, and numerals most 
sensitive to distortion of legibility. And 
of fundamental importance, of course, is 
the development of improved criterion 
measures of handwriting legibility. 
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FAILURE IMAGERY IN THE FANTASY OF 
EIGHTH GRADERS AS A FUNCTION OF 
THREE CONDITIONS OF 
INDUCED AROUSAL 


RICHARD C. ANDERSON 
Harvard University * 


An experiment was conducted in which 3 groups of 8th graders (total 
N = 143) completed a projective test under conditions of low-arousal, 
medium-arousal, and high-arousal respectively. Fear of Failure (f Fail- 
ure), as well as n Achievement, significantly discriminated the 3 condi- 
tions of induced arousal during projective testing. The medium-arousal 
group rather than the high-arousal group yielded the highest mean f 
Failure. It was argued that negative results in past research on f Failure 
may have resulted from anxiety-provoked repression of failure imagery 


under high-arousal conditions. 


The original research of the Mc- 
Clelland group (McClelland, Atkinson, 
Clark, & Lowell, 1953, 218-274) yielded 
evidence suggesting that there might be 
two recognizable aspects of achievement 
motivation, fear of failure (f Failure) 
as well as hope of success (n Achieve- 
ment). Subsequent work has sustained 
this belief (Clark, Teevan, & Ricciuti, 
1958). Nevertheless, in two investiga- 
tions using a full-blown measure of f 
Failure (Moulton, 1958; Raphelson & 
Moulton, 1958) the f Failure means of 
achievement-oriented groups were not 
significantly higher than the f Failure 
means of relaxed groups. The research 
described in this paper was designed to 
see whether a projective measure of 
f Failure would discriminate conditions 
of induced arousal and to explore the 
relationships of f Failure to self- descrip- 
tive measures of anxiety. 


PROCEDURE 


The research was conducted with a sample 
of 143 eighth graders attending suburban jun- 
ior high schools in the Boston area. The total 
sample was divided into three experimental 
groups. The intelligence test means of the 


1 The author is now with the East Bruns- 
wick Public Schools, New Jersey 
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three groups were not significantly different 
nor were the means on an index of socioeco- 
nomic status. The data of males and females 
were pooled since a pilot study yielded no 
significant sex differences. 

The experiment was designed in such a way 
that projective protocols were elicited from 
three groups of subjects under three different 
conditions of arousal. All subjects performed 
on the first form of a digit-letter task under 
what might be called “neutral conditions.” 
Following performance on the first form of 
the digit-letter task all groups were told that 
they had just done very poorly on a test of 
intelligence which was related to college suc- 
cess, driving skill, and being a good secretary 
At this point, the second form of the digit- 
letter task was administered. One of the three 
experimental groups took the projective test 
just prior to the first form of the digit-letter 
task, under what can be called “low-arousal 
conditions.” The test was introduced without 
special fanfare. Instructions contained no ref- 
erence to “test.” A second experimental group 
received the projective test under “medium- 
arousal conditions.” The projective test was 
given prior to performance on the first form of 
the digit-letter task and was preceded by an 
“explanation” of the research in which the 
experimenter stated that he was “trying to find 
out how to make school more interesting and 
valuable” for bright, successful students and 
those who are “kind of stupid.” The projective 
test was introduced as a “test of imagination” 
and the instructions contained several refer- 
ences to test. The “high arousal” group took 
the projective test following the second form of 
the digit-letter task. Once again the experi- 
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menter indicated that the subjects had done 
poorly on a test of important abilities. He 
then stated that he would give them the op- 
portunity to redeem themselves on a test of 
imagination. 

Four pictures, flashed on a screen in front 
of the room, were used to elicit projective pro- 
tocols. The pictures can be briefly described as 
follows: 

1. A Negro boy in an integrated class is 
raising his hand. 

2. A boy leaning against a light pole is talk- 
ing with a girl standing by a car. 

3. In the foreground, a man and a boy kneel 
on the floor together, apparently doing home- 
work, working on a puzzle, or playing a game. 
Two women and a baby are on a sofa in the 
rear. 

4. A group of teenage boys and girls are sit- 
ting in a semicircle. They appear to be listen- 
ing to what a boy in the foreground is saying. 


Following usual procedure (Atkinson, 1958, 
p. 837), the four standard questions were 
printed on projective answer sheets to struc- 
ture the subjects thinking slightly. 

The scoring conventions recently revised by 
the McClelland group (McClelland, Atkinson, 


ANDERSON 


Clark, & Lowell, 1958) were used in determin 
ing n Achievement. These conventions were 
supplemented with the category Explicit Eval- 
uation which Ricciuti and Clark (1957, p. 23) 
found to be a particularly good predictor of 
the presence or absence of achievement mo- 
tivation. A revised and extended version of 
Moulton’s schema was employed in scoring 
fear of failure. The chief revision was the add- 
ing of the “negative” categories (I—, Ga—, 
Bp, Bw, G—), which appeared in stories 
otherwise manifesting positive achievement 
motivation, to the f Failure score. Table 1 
contains a brief summary of the f Failure 
scoring conventions used in the present investi- 
gation. For illustration, f Failure scoring has 
been spelled out in detail for the following 
story of an eighth grade boy 


A teenage boy is called on in school and 
everyone is awaiting the outcome of his 
answers—the boy wasn’t paying attention 
so everybody looks at him—they think the 
boy is stupid! The boy wants the answers 
from one of his classmates—the boy will get 
bawled out by the teacher and may be 
kicked out of class 


TABLE 1 


SUMMARY OF F FarLure Scorinc System 


Scoring 


Name of category wolaht 


Failure imagery (FT) 


Doubtful failure imagery (DF1) 


Need to escape achievement situation 
(Ne) 


Brief definition 


Failure imagery is scored when there is 
(a) any indication of poor performance in cus- 
tomary achievement situation, (6) any doubt 
of ability to reach achievement goal, (c) any 
anticipation of failure, (d) any indication of 
discouragement or negative affect at the pos- 
sibility of achievement evaluation, (e) an 
expressed preference to leave an achievement 
situation or situation in which achievement 
evaluation might take place, (f) daydreaming 
in an achievement situation unless the subject 
of the daydream is of successful achievement 


Need to escape is scored when (a) a desire 
to leave an achievement situation is expressed 
or (b) a preference is expressed for another 
type of activity in an achievement situation, 
or (c) a desire for someone else to help or 


takeover in an achievement situation is ex- 


pressed 
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TABLE 1—Continued 


Scoring 


- Brief definition 
weight 


Name of category 


Instrumental failure ® (I Instrumental failure is scored whenever some 
physical or mental act designed to reach a pos 
tively cathected achievement goal fails 


Instrumental escape (le Instrumental escape is scored whenever a 
character acts to avoid or escape an achieve- 
ment situation. 


Anticipation of failure*® (Ga Anticipation of failure is scored whenever a 
character doubts he will succeed or predicts 
failure. 


Block in world to achievement success * Scored whenever a physical object, turn of 
(W—) events, or other person stands in the way of 
attaining an achievement goal 


Block in person to achievement succes Scored whenever some personal characteris 
(P—) tic, such as negligence or laziness, stands in 
way of attaining achievement goal 


Block in world to escape from achiev Scored whenever some physical object, turn 
ment situation (We) of events, or other person stands in way of 
escaping from achievement situation 


Block in person to escape from achiev: Scored whenever some personal characteris- 
ment situation (Pe) tic stands in way of escaping achievement 
situation. 


Negative affect at failure * (G Negative affect is scored when (a) someone 
experiences a negative emotional state as th« 
result of poor performances or low evaluation 
(b) the objective consequences of failure fall 
into the lower 10% of possible outcomes. 


Hostile press (HosP a) Hostile press is scored whenever there 
is overt criticism of one character by another 
including predictions of failure, (6) ridicule or 
negative affect displayed toward character who 
fails 


Negative evaluation (E Negative evaluation is scored whenever a 
character evaluates his own performance as 
inadequate. 


Failure thema (FTh Failure thema is scored whenever a story 
meets the definition of failure imagery, con- 
tains no reference to positive striving for 
achievement success, and contains no major 
subplots manifesting other types of motivation 


* Categories which are scored, when present, in stories showing achievement imagery (AI) but not failure imagery 
(FI). Otherwise none of the categories is scored unless FI is first scored 
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Failure imagery—There are several obvious 
grounds for scoring fear imagery. Having 
scored fear imagery it is possible to score 
other failure categories if they are present. 
Scored 2. 

Need to escape achievement situation—‘“the 
boy wants the answers from one of his class- 
mates.” Scored 1. 

Block in person to achievement success— 
“the boy wasn’t paying attention.” Scored 1. 

Hostile press—“everybody looks at him .. . 
they think the boy is stupid . . . the boy will 
get balled out by the teacher.” Scored 1. 

Negative affect at failure—By analogy to 
the conventions developed in connection with 
n Achievement (McClelland et ai., 1958, 
p. 200), if the objective state of affairs accom- 
panying failure falls into the lower 10% of 
possible outcomes for that type of activity G— 
is to be scored. Applying this convention to 
the present story, it does seem that the conse- 
quences of failing to know an answer were 
especially harsh. Scored 1. 

Failure thema—The story concerned failure 
and only failure, thus FTh must be scored. 
Scored 1. 

Summing the scores assigned for individual 
categories, the subject received a f Failure 
score of 7 for this story. His total score con- 
sisted of the sum of his scores on all stories. 

Two raters, with previous experience scoring 
n Achievement, independently scored the sto- 
ries collected in the present investigation when 
they had reached a criterion of agreement 
(rho = .90) on sets of practice stories. All 
scoring was done without knowledge of which 
experimental group the subject was from, what 
score the subject had obtained on other stories, 
or any other information about the subject. 
One rater scored all of the stories written to 
the first and third pictures. The other scored 
the stories written to the second and fourth 
pictures. For reliability purposes, both raters 
scored all the stories of 40 randomly chosen 
subjects. Rank-order correlations of .76 and 
89 were obtained between n Achievement 
scores and f Failure scores of the two raters, 
respectively. Stories written to one picture were 
discarded when it was discovered that only 3 of 
143 protocols contained either achievement or 
failure imagery. So each subject’s total n 
Achievement or f Failure score represented the 
sum of his scores on three stories. The distri- 
butions of raw n Achievement and raw f 
Failure scores were both positively skewed. 
The transformation Vx +3 was applied to n 
Achievement scores while the transformation 


Vx + 1 was applied to f Failure scores. These 
transformations were applied before testing the 


ANDERSON 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF PROJEC- 
TIVE MEASURES FOR THREE EXPERIMENTAL 
Groups 


High- 

arousal 

group 
(N = 45) 


Medium- 
arousal 
group 

(WH = 51) 


Low- 
arousal 
group 
; = 47 ) 


Projective 
measure 
CN 
n Achievement 
M 
SD 
f Failure 
M 
SD 
Note.—Statistics in this table are based on trans- 


formed data. Transformations of vx +3 and vx+1 
were applied to n Achievement and f Failure, respec- 
tively 


significance of differences among means, or 


computing correlations with other variables. 

All subjects completed a battery of six self- 
descriptive measures which included the Mani- 
fest Anxiety scale (Taylor, 1953) and a test 
anxiety scale 


RESULTS AND DISCUSSION 


The n Achievement and f Failure 
means and standard deviations of the 
three experimental groups appear in 
Table 2. Bartlett’s test indicated that 
the variances of the three groups were 
not significantly different on either of 
the projective measures. Analysis of 
variance (see Tables 3 and 4) showed 
the n Achievement means and the f 


TABLE 3 


ANALYSIS OF VARIANCE OF N ACHIEVEMENT 
MEANS OF THREE EXPERIMENTAL GROUPS 
Source SS 
6.41 
120.02 


Between 
Within 
Total 


3.21 
££ {i 
86 





InpucEeD AROUSAL OF FAILURI 


Failure means of the three groups sig- 
nificantly different at the .05 and .01 
levels, respectively. The total-sample 
correlation between n Achievement and 
f Failure was .19. 

In the case of n Achievement, the 
means of both the high-arousal and 
medium-arousal groups were signifi- 
cantly higher than the mean of the low- 
arousal group. The high-medium dif- 
ference was not significant. 

The f Failure mean of the medium- 
arousal group, but not the high-arousal 
group, was significantly higher (p < 
.001) than that of the low-arousal group. 
The f Failure mean of the medium- 
arousal group was higher than the f Fail- 
ure mean of the high-arousal group, but 
when a two-tailed test was applied the 
difference fell short of customary levels 
of significance (.10 > p > .05). Why 
did the medium-arousal group display 
the most failure imagery? The high- 
arousal group had just “failed” on an 
“intelligence test” which the group was 
told was related to getting into college, 
being a good secretary, and driving a 
car. Yet the medium-arousal group, in 
which achievement was made salient in 
a much less forceful and personal way 
showed a higher f Failure mean than 
the high-arousal group. Conceivably, 
conditions of high-arousal provoke 
anxiety about failure and result in a 
repression of failure imagery among 


TABLE 4 


ANALYSIS OF VARIANCE OF THE F FAILURE 
MEANS OF THREE EXPERIMENTAL GroUPS 


Source 


Between 
Within 
Total 


IMAGERY 

some subjects. In a less _ personal 
achievement-relevant situation—such as 
the medium-arousal condition—anxiety 
is not aroused to such a degree and de 
fenses are relatively inoperative, so con 
siderable failure imagery does appear. 
In a low-arousal situation where there 
are few achievement-related cues little 
failure imagery appears. 

In short, a curvilinear relationship is 
postulated between degree of arousal 
in the projective testing situation and 
amount of failure imagery appearing in 
projective protocols. This hypothesis 
may account for the fact that the 
medium-arousal group showed the high- 
est mean f Failure in the present experi- 
ment and may also account for the nega 
tive results of previous research (Moul- 
ton, 1958; Raphelson & Moulton, 1958), 
if it is assumed that the more arousing 
of the experimental conditions in these 
studies provoked anxiety about failure 

It is argued that the high-arousal 
group displayed less f Failure than the 
medium-arousal group, and only slightly 
more than the low-arousal group, because 
of a defensive reaction to an anxiety- 
provoking situation. The pattern of re- 
lations f Failure from the various arousal 
groups showed with the two self-descrip- 
tive anxiety scales is congruent with 
this interpretation. High-arousal f Fail- 
ure correlated —.11 with the Manifest 
Anxiety scale and —.22 with the Test 
Anxiety Scale. In contrast, low-arousal 
f Failure correlated .18 and .19 with 
the Manifest Anxiety scale and the Test 
Anxiety Scale, respectively. The parallel 
coefficients for the medium-arousal group 
were .03 and .10. None of these corre- 
lations alone was significantly different 
from zero, nor were differences between 
any of the correlations significant. How- 
ever, the direction of the coefficients is 
congruent with the hypothesis that 
anxious persons in the high-arousal 
group repressed failure imagery. 
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These results are consistent with those 
reported by Raphelson and Moulton 
(1958). In their first experiment, a red 
light flashed when subjects made errors 
on a pursuit rotor, which they frequently 
did. Fear of Failure, scored from a 
projective test administered immediately 
after performance on the pursuit rotor, 
correlated .25 with the test anxiety 
questionnaire. In the second experi- 
ment the flashing light was omitted. 
Raphelson and Moulton argued that the 
lack of information in the second ex- 
periment increased uncertainty about 
performance, and that “this perceived 
uncertainty” increased “the amount of 
aroused anxiety in all Ss.” In the 
second experiment, f Failure correlated 
—.38 with the test anxiety question- 
naire. The difference between the cor- 
relations obtained in the two experi- 
ments was significant at the .05 level. 
Raphelson and Moulton interpreted the 
negative correlation obtained in the sec- 
ond experiment as meaning that highly 


anxious subjects were defensive or “left 
the field psychologically.” The similarity 
in findings of the present investigation 
and the Raphelson-Moulton studies, de- 
spite different experimental conditions, 
lends credibility to the results and their 
interpretation. 


ANDERSON 
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ANXIETY, EXPERIMENTAL INSTRUCTIONS, 


AND THE WECHSLER ADULT 
INTELLIGENCE SCALE’ 
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SARASON ann JAMES MINARD 
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4 subscales of the WAIS were administered to 96 college students in 


an experiment in which there were 3 variables: 


(a) Ss’ Test Anxiety 


Scale scores, (b) sex of S, and (c) preliminary conditions (achievement- 


orienting and neutral instructions). 


On the Vocabulary, Block Design, 


and Comprehension subscales low test anxious Ss were superior to high 
test anxious Ss. For Comprehension, this effect was largely attributable 


to the effects of instructions 


High and low test anxious Ss did not 
differ on Digit Symbol, but a Test Anxiety 


< Sex X Instructions inter- 


action was obtained for this subscale. The results were interpreted as 
indicating the relevance to performance of the 3 variables studied as well 
as the importance oi the task variable in research and anxiety. 


The present paper reports evidence 
relevent to two questions: Do subjects 
differing in test anxiety differ in their 
performance on an individually admin- 
istered standardized intelligence test, a 
short form of the Wechsler Adult In- 
telligence Scale (WAIS)? Do experi- 
mental orienting instructions adminis- 
tered prior to performance differentially 
affect high and low test anxious subjects? 

The experiment performed was con- 
ceived in relation to certain empirical 
findings already reported (Sarason, 
1960). Available evidence has demon- 
strated a significant negative relation- 
ship between scores on the Test Anxiety 
Scale (TAS) and group administered 
indices of intelligence (Sarason, 1957, 
1959, 1961). There are also indications 
reported of differential influences of 
stressful and nonstressful instructions 
on the performance of subjects differing 
in anxiety (Lucas, 1952; Nicholson, 
1958; Sarason, 1956). These latter find- 
ings have come, however, primarily from 
verbal learning experiments. It seemed 
of value to determine the relationship 


1 This study was supported by a grant from 
the Agnes H. Anderson Research Fund 
2 Now at the University of Colorado 


of TAS scores to the individually ad- 
ministered WAIS and, also, to deter- 
mine the extent of the interaction be- 
tween TAS scores and instructions for 
that test. 


METHOD 
Subjects 


The subjects were 96 college students en- 
rolled in introductory psychology classes. They 
had been selected from a larger group of stu- 
dents, which had been administered the true 
false TAS (Sarason, 1959, 1960; Sarason & 
Palola, 1960). Half of the 96 subjects had TAS 
scores of 8 and above, and half had scores of 
O or 1. These cutoff points corresponded ap- 
proximately to the upper and lower 15% of 
the TAS distribution. The high and low TAS 
groups were selected so that each was com- 
posed equally of males and females 


Procedure 

Each subject was individually administered 
four subtests of the WAIS (Wechsler, 1955) 
Two Verbal scale subtests, Vocabulary and 
Comprehension, and two Performance scale 
subtests, Block Design and Digit Symbol, were 
the tasks employed. Except for the experi- 
mental preliminary instructions, the testing 
procedures outlined in the WAIS manual were 
followed. 

The order of presentation of the four sub 
tests was handled in the following manner 
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Half of the subjects were administered the two 
Verbal tests first, while the remaining subjects 
were administered the Performance tests first. 
Order of administration within the two Verbal 
and within the two Performance tests was 
randomized. In order to determine whether or 
not order of administration of Verbal and Per- 
formance subtests influenced the subject’s 
WAIS scores, F tests were performed. The 
results showed that there was no significant 
order effect. 

Half the subjects received the following 
achievement-orienting instructions at the begin- 
ning of the experiment: 


In this experiment I want to make an evalua- 
tion of your intellectual level. Although most 
students in college are above average intelli- 
gence there are some who aren’t any better 
than average in ability. By using the scales 
of the Wechsler Adult Intelligence Test, we 
will be able to pinpoint those students who 
are below the level of intelligence needed for 
successful work in college. 


The other subjects received the following 
more neutral instructions: 

I'd like you to help me in this way. These 

are some tasks I’m getting practice in ad- 

ministering. They start with easy items and 

work up to more difficult ones 


RESULTS 


Three factor analyses of variance (12 
subjects per cell) were performed using 
subtest raw scores and the scaled score 
sums as the measures. Table 1 presents 
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the means for the scaled score sums and 
for the subtest raw scores. When the 
sum of the scaled scores was used as 
the measure, only one F was found to 
be statistically significant. Low TAS 
subjects obtained higher scores than 
did high TAS subjects (p < .05). 
Analysis of various of the subtest raw 
scores for Vocabulary yielded the same 
results (p < .05). In both these anal- 
yses, the superiority of low to high TAS 
subjects was as great under the neutral 
as under the achievement-orienting con- 
dition. 

For Comprehension, the low TAS 
group performed at a higher level than 
did the high TAS group (p < .05). 
However, this result was due largely 
to differences under the achievement- 
orienting condition. Under this condi- 
tion, low TAS subjects were superior to 
high TAS subjects (p < .02), but 
under the neutral condition there was 
no significant difference. 

The Digit Symbol analysis did not re- 
veal any overall differences due to TAS 
or instructions. The one significant F 
involved the interaction of TAS scores, 
instructions, and sex (p < .05). This 
result was due to differences in response 
to experimental instructions of male and 


TABLE 1 


MEANs For SUM OF SCALED SCORES AND FoR SuBTEST Raw Scores 
(12 Ss per group) 


Scaled 


Group * 
Sores sum 


hension 


23.92 
23.25 
24.00 
24.33 
26.00 
25.42 
24.42 
24.42 


H-M-(A-O) 
H-F-(A-O) 
H-M-N 
H-F-N 
L-M-(A-O) 
L-F-(A-O) 
L-M-N 
L-F-N 


be 
oo 


YFOonnaAuUanaauwn 
FNADANVN 
~NOoOwhD iu 


ViM@ww bw «7 


* First designation characterizing each group refers to high (H) or low (L) TAS scores: 
refers to male (M) or female (F) subjects 
A-O) or neutral condition (N) 


Compre- 


Vocabulary Digit Symbol Block Design 


39.42 
36.33 
38.42 
26.17 
42.85 
37.92 
44.17 
37.08 


65.00 
65.42 


$5.75 


62.42 
61.92 
63.25 
62.00 68.67 
70.00 62.58 
64.00 5.75 
65.50 50 
67.08 3.50 


second designation 


and third designation indicates either achievement-orienting conditior 
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female high TAS subjects. Female high 
TAS subjects performed at a higher 
level under neutral than under achieve- 
ment-orienting conditions, whereas male 
high TAS subjects performed at a higher 
level under the achievement-orienting 
than under the neutral condition. 
There were two significant Fs in the 
analysis of Block Design scores. One 
was for sex (p < .01), with males su- 
perior to females; the other was for 
TAS (p < .05), with low superior to 
high TAS subjects. This difference be- 
tween high and low TAS subjects tended 
to be stronger for males than for females. 


DISCUSSION 

With respect to overall performance 
level, it seems clear that low test 
anxious subjects were generally superior 
on this short form of the WAIS to high 
test anxious subjects. This result con- 
firms previous findings using group ad- 
ministered measures of intelligence 
(Sarason, 1957, 1959). How may these 
results be interpreted? Two alternatives 
come to mind. Perhaps TAS is simply 
a measure of intelligence. On the other 
hand, it might be considered as a meas- 
ure of a nonintellective, personality 
variable which, with intelligence, influ- 
ences performance. Obviously the latter 
hypothesis is, psychologically, the more 
interesting and potentially more fruit- 
ful conception. Supporting this hy- 
pothesis is evidence from learning ex- 
periments which has indicated that 
under certain instructional conditions 
low anxious groups are superior to high 
anxious groups, and under other con- 
ditions there are no significant differ- 
ences (Sarason, 1956, 1958). This sug- 
gests that ability, per se, may not be 
the basis for observed differences be- 
tween high and low anxious groups; and 
that these groups differ in their response 
to the variable of instructions. 

One way of viewing TAS scores is to 
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assume that they reflect the tendency 
of the individual to show deterioration 
in performance under personally threat- 
ening or stressful conditions which are 
associated with performance. Or, as 
Atkinson (1960) has suggested, anxiety 
scores may reflect varying strengths of 
the need to avoid failure. This interpre- 
tation would lead one to view experi- 
mental instructions and conditions as 
important factors in the arousal of the 
need. If this type of assumption is cor- 
rect it might, then, be hypothesized 
that (a) under such conditions, high 
TAS subjects would be inferior in per- 
formance to low TAS subjects, but that 
(6) under more neutral conditions, there 
would be no difference attributable to 
TAS score differences. 

Of the four tasks used in the present 
experiment, this hypothesis was sup- 
ported in the case of Comprehension, 
but not in the cases of Vocabulary and 
Block Design. Support for the hy- 
pothesis of a relationship between 


anxiety and experimental instructions 
was obtained .for Digit Symbol scores, 
but within the context of an interaction 
more complex in nature than would be 
encompassed by the conception sug- 


gested above. For Digit Symbol, the 
TAS-instructions interaction was com- 
plicated by the variable of sex of the 
subject. For females, as would be pre- 
dicted, high TAS subjects performed at 
a higher level under achievement orien- 
tation than under neutral conditions. 
However, the reverse situation, obtained 
for high TAS males, had not been ex 
pected. 

The lack of significant simple rela- 
tionships for anxiety and instructions 
for Digit Symbol and the obtained 
triple interaction is interesting in the 
light of previous research in which these 
variables were found to interact signifi- 
cantly with a third variable. Sarason 
and Palola (1960), using a digit sym- 
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bol task, found that TAS and instruc- 
tions interacted significantly with task 
difficulty. The present results, together 
with evidence already reported, suggest 
that the relationship between anxiety 
and experimental instructions may be 
importantly influenced by the nature of 
the task on which the subjects perform, 
the difficulty of that task, and, at least 
one subject characteristic, sex. The na- 
ture of the task seems to be particu- 
larly important, since the degree of ef- 
fectiveness of experimental instructions 
which describe the task as a measure of 
intelligence probably depends on the ex- 
tent to which the task -has face validity 
for the subject as a measure of intelli- 
gence. Both instructions and the task 
can be viewed as inducing varying de- 
grees of motivation. Research incor- 
porating these, and other potentially 
relevant factors, such as characteristics 
of experimenters and the particular anx- 
iety instrument employed, should prove 
valuable in clarifying the relationship 
of anxiety to performance. This sort of 
research would also be relevant to the 
clinical testing situation since it would 
shed light on the complex question of the 
optimum method of presenting or in- 
troducing psychometric instruments to 
individuals. 
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TO SEEK ADVANCED TRAINING: 
ANOTHER LOOK’ 
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A recent study (Thistlethwaite, 1962), in which certain college “presses” 
were interpreted as influencing the student’s motivation for advanced 
training, was re-examined. The method used for controlling the student’s 
initial educational level of aspiration—the analysis of covariance—was 
thought to be inappropriate, primarily because of anomalies in the corre- 
lation between initial and final levels of aspiration. In addition, the 
estimate of initial level of aspiration, which had been obtained retro- 
spectively from the Ss, was judged to be too undependable for use as a 
measure of initial level of aspiration. Reanalyses of some of the data 
failed to support the author's original conclusions about the “effects” of 
the press on student motivation. Certain methodological problems in 
studying the influence of college environments on the student were 


discussed. 


In a recent paper Thistlethwaite 
(1962) reported an extensive series of 
analyses which appear to demonstrate 
‘the influence of faculty and student 
“presses” on the college student’s edu- 
cational plans. He concluded from these 
analyses that the male student's level 
of aspiration (LOA, highest degree 
planned) is increased during the first 
2 years of college by high faculty 
presses for “independence,” “suppor- 
tiveness,” and “affiliation”; low faculty 
press for “vocationalism’’; high student 
press for “openness to faculty influ- 
ence’; and participation in honors pro- 
grams. The principal justification for 
these interpretations was that the anal- 
yses had ruled out certain “rival hy- 
potheses” (i.e., rival to the “environ- 
mental effect” hypothesis), namely, the 
student’s sex, major field, and level of 


1 This paper is a part of the research program 
of the National Merit Scholarship Corpora- 
tion and was financed by a grant from the 
National Science Foundation. The author is 
indebted to J. L. Holland for his critical read- 
ing of the manuscript. 


aspiration at the time he enters college.* 
However, a re-examination of the evi- 
dence suggests that these conclusions 
are vitiated by inaccuracies in the 
student reports of initial LOA and by 
the method of analysis which was used. 
METHOD 


APPROPRIATENESS OF THE 


The principal statistical method used 
by Thistlethwaite was the analysis of 
covariance. Posttest LAOs (after 2 
years of college) for various “press” 
groups were adjusted in terms of pre- 
test LOAs (obtained retrospectively). 
Of the several assumptions required by 
the analysis of covariance, the nature 
of the regression between the control 


2It should be noted that these rival hy- 
potheses, which Thistlethwaite attributes to the 
present writer (Astin, 1961) and others (Hol- 
land, 1957; Stuit, Helmstadter, & Frederiksen, 
1956), were not proposed with respect to the 
“student’s stated level of aspiration after two 
years of college.” On the contrary, these hy- 
potheses were proposed as alternatives to an 
environmental effects interpretation of a quite 
different dependent variable: the proportion of 
a college’s graduates who eventually obtain the 
PhD. 
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and dependent variables is of special 
importance. Figure 1 shows the distri 
bution of Posttest LOAs for each level 
of Pretest LOA using Thistlethwaite’s 
male subjects. (The lowest level, “less 


Felis 





——— OS 


(N= 128) 


Professional 
(MD, LLB, etc.) 














PhD 
(N=225) 














Master's 
(N=112) 











© 
2 
rv) 
® 
a 
2 
2 
° 
a 
c 
2 
) 
2 
a 
o 
a 
~ 
° 
> 
> 
® 
a 
3 
£ 








Bachelor's 
(N=140) 


Number of subjects 
in each interval 


Professional 
(AAD, LLB, etc.) 
Bachelor's 


Final Level of Aspiration 


Fic. 1. Final LOA after 2 years of college 
as a function of retrospective report of initial 
LOA at time of entrance to college. (Male 
subjects from Thistlethwaite, 1962) 
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than bachelor’s degree,’ has _ been 
omitted because only one subject gave 
this as his Pretest LOA.) An inspection 
of Figure 1 indicates that the regression 
is heteroscedastic. This observation is 
supported by Bartlett’s test, which is 
significant (p < .01). In fact, four of 
the six possible pairs of column vari- 
ances are heterogeneous at the .01 level. 
In addition to heteroscedasticity, several 
of the arrays in Figure 1 are markedly 
skewed, and two of the arrays appear 
to have bimodal distributions. 

In view of these deviations from the 
conditions necessary for employing the 
Pearson 7, it would seem that control- 
ling initial LOA by means of the anal- 
ysis of covariance is inappropriate. In 
addition, since initial LOA was obtained 
from the subject concurrently with a 
report of the press, the possibility exists 
that the independent variable (press) 
has affected the control variable, initial 
LOA, (a discussion of how this might 
occur follows later). Such an effect 
would also invalidate the analysis of 
covariance. 

If one were to overlook the anomalies 
in Figure 1, the use of covariance analy- 
sis in this situation can still be ques- 
tioned on the grounds that similar “ad- 
justed posttest” LOAs are in some cases 
based on qualitatively different events. 
For instance, the student who changes 
from PhD to the MD degree receives 
approximately the same adjusted post- 
test LOA as the student who changes 
from the MA to the BA degree. A re- 
lated question is whether the MD or 
LLB actually represent “lower” levels 
of aspiration than the PhD. Does the 
student who abandons his plans for the 
MD and pursues instead the PhD in 
biology really “raise” his level of as- 
piration? The MD is no lower in terms 
of the amount of formal training re- 
quired, and a good case can probably 
be made for the point of view that the 
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MD has more prestige associated with 
it than almost any other degree. In 
terms of the academic degree hierarchy, 
the MD and LLB rank with the bac- 
calaureate degrees; but to follow this 
ordering would be to treat any profes- 
sional degree as a lower LOA than even 
the master’s degree. Perhaps the most 
defensible ordering, in terms of status 
and length and difficulty of training re- 
quired, would be to combine the PhD 
and professional degrees into a single 
(“highest”) category. 


OF THE RETROSPECTIVI 
REPORT 


ACCURACY 


All analyses of the effects of differ- 
ent presses were based on retrospective 
reports by the subjects of their LOAs 
at the time they entered college. While 
it was acknowledged that the true ini- 
tial LOA “possibly would not cor- 
relate perfectly with retrospective pre- 
tests” (Thistlethwaite, 1962, p. 54), 
certain other analyses recently com- 


pleted with another sample suggest that 
such retrospective reporting is highly 
inaccurate. These analyses, based on a 
large sample of 1957 Merit Finalists 
and Commended Students, are shown in 


Table 1. These students were asked 
their LOA shortly after they entered col- 


lege in Fall 1957, and again as they 
were completing college in 1961. In the 
1961 survey they were also asked to re- 
call what their LOA had been in 1957. 
From Table 1 it is clear that errors in 
the retrospective report are common. 
There is a systematic tendency for both 
males and females to underestimate 
their initia! LOA, although overestima- 
tion also occurs with some frequency. 
Errors in recall vary with each actual 
level in 1957, the most frequent occur- 
ring at the master’s level, where only 
33% of the students who initially chose 
the master’s level in 1957 “remembered”’ 
in 1961. Accuracy of recall among 
students who initially chose the bache- 
lor’s or doctor’s degree in 1957 was 
79%. 

A further analysis of the subjects in 
Table 1 showed that among the 1,605 
males who changed LOA between 1957 
and 1961, only 45% recalled their 
initial LOA correctly. The accuracy of 
recall for the 2,336 male nonchangers 
was 79%. Accuracy of recall was 57% 
for the 874 female changers and 70% 
for the 1,029 female nonchangers. A 
possible explanation for this differential 
accuracy between changers and non- 
changers is that when the subject is un- 
sure about what his initial LOA really 


TABLE 1 


ACCURACY OF THE RETROSPECTIVE REPORT 


Actuat LOA tn 1957 CoMPARED WITH RETROSPECTIVE 


Report 4 YEARS LATER 


Retrospective report of 1957 LOA 


Less than 


Actual LOA in 1957 bachelor’s 


Males 


Doctor’s (PhD 
MD., etc.) 

Master’s 

Bachelor’s 

Less than bachelor’s 
Totals 
Percentage of 

rect recall 


Bachelor's 


Females 


Doctor's 
(PhD, MD, etc.) 


Males 


Totals 


Males Females 


Master’s 


Males Females Females 
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was, he tends to give his current LOA. 
Whenever this happens the nonchanger 
would be “right” and the changer 
“wrong.” 

Although it is not possible to check 
on the accuracy of recall of LOA for 
all of the subjects used by Thistle- 
thwaite, some of his sample does overlap 
with another sample of Merit Finalists 
(Holland, 1960) on which LOA was ob- 
tained immediately prior to their enter- 
ing college in 1958. Since Thistle- 
thwaite’s positive results with environ- 
mental effects were obtained with male 
subjects,* the following analyses are 
confined to the 150 male subjects who 
overlapped in the two samples. Table 2 
shows the accuracy of the retrospective 
report of LOA obtained from these sub- 
jects by Thistlethwaite. The pattern of 
errors in Table 2 is very similar to that 
obtained with males in the much larger 
sample (Table 1), even though the in- 
terval of time was only 2 (rather than 


8 Thistlethwaite does mention one significant 
environmental effect on LOA among women: 
the Faculty Press for Humanism was negatively 
related to increased LOA. This finding was 
exactly the opposite of the effect of the Hu- 
manism press reported in a previous study 
(Thistlethwaite, 1960). Both of these studies, 
however, used retrospective reports of initial 
LOA. 


4) years. The slightly greater rate of 
error for Thistlethwaite’s males may be 
due in part to the use of the additional 
level (professional degrees). Once again, 
the changers (V=57) showed a much 
lower accuracy of recall (26%) than 
the nonchangers (81%). 

It seems clear from these results that 
retrospective reports of LOA, even after 
only 2 years, are undependable. The 
rate of accuracy for the subjects who 
actually do change LOA is essentially 
chance. Assuming that mo subjects 
changed LOA between 1958 and 1960 
results in fewer errors (62% accuracy) 
than were involved in the retrospective 
reports (60% accuracy). 

In short, the evidence suggests that 
retrospective reports of initial LOA are 
not sufficiently accurate to use as sub- 
stitutes for true initial LOA. Accord- 
ingly, this and other studies (e.g., 
Thistlethwaite, 1960) which have uti- 
lized such retrospective reports require 
replication with the use of true initial 
and final LOAs before one can conclude 
that the college press affects LOA. 


REANALYSIS OF DATA 


Since there were 150 male subjects for 
whom initial statements of LOA in 1958 
were available, an attempt was made 


TABLE 2 


AccuRACY oF THE Retrospective Report For 150 MALE SusByects 
(Subjects from Thistlethwaite, 1962) 


Retrospective report of 1958 LOA 


Actual LOA in 1958 Bachelor’s 


PhD 11 
Professional (MD, LLB, etc.) 2 
Master’s 
Bachelor's 12 
Totals 35 
Percentage of correct 
recall 


Master’s 


Professional 


(MD, LLB, etc.) Totals 


PhD 


8 


1 
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to replicate Thistlethwaite’s positive 
findings regarding the effects of press. 
The first problem was to determine 
whether any of the relationships be- 
tween press and final LOA held up when 
initial (true) LOA was held constant. 
For this purpose three separate analy- 
ses were carried out. In the first anal- 
ysis all subjects whose initial LOA was 
at the bachelor’s level were divided 
into two groups: those who raised their 
LOAs between 1958 and 1960 and those 
who did not. For the second analysis, 
the subjects with initial LOAs at the 
master’s level were similarly divided 
into those who did not change and those 
who increased their LOAs. (In these 
first two analyses four subjects who 
lowered their LOAs, and two others who 
gave unclassifiable LOAs in 1960, were 
omitted. In addition, one subject, not 
shown in Table 2 because his retro- 


spective LOA was “unclassifiable,” was 
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added.) In the third analysis, all sub 
jects whose initial LOAs in 1958 were 
at the doctorate level (PhD, MD, etc.) 
were divided into two groups: those 
who maintained this LOA in 1960 and 
those who reported a lower LOA (ic., 
master’s or bachelor’s) in 1960. One 
subject whose 1960 LOA was unclassi 
fiable was omitted from this analysis.) 
The three groups of changers were then 
compared with their respective non- 
changers on each of the presses by means 
of ¢ ratios. Table 3 shows the results of 
these analyses. None of the 18 one- 
tailed ¢ tests in Table 3 reaches signifi 
cance. Exactly half of the mean differ 
ences are in the predicted direction, and 
the 2 largest ¢ ratios occur with mean 
differences which are actually in oppo 
site directions from what would be pre- 
Thistlethwaite’s conclu 


dicted from 


sions. 


TABLE 3 


REANALYSES OF THE Errects OF Press 
(145 male subjects from Thistlethwaite, 1962) 


Bache 

Me an 
for sub 
jects 
with 
higher 
LOA in 
1960 
V—14) 


Mean 
for non- 
changers 
(N—6) 


Faculty press 
Independence 
Supportiveness 
Affiliation (+ 
Vocationalism 

Student press: 
Openness to faculty 

influence (+) 

Honors program par- 
ticipation (+-)* 


* t.06 = 1.73, one-tailed test 
> ¢.06 = 1.72, one-tailed test 
© tf. = 1.66, one-tailed test. 
4 Direction of relationship betweer 
* The percentage of honors program partici; 


pnd tendency t 
ants in each group is 


¢) LOA in 1958 
Doctor’s 
PhD, MD, etc.) 


Mean 
for sub- 
jects 
with 


lower 


Master 
Mean 


for sub 
jects Mean 

with for non 

higher hangers 

LOA ir V—89) LOAin 
1960 1960 


(N=11) V=13 


Mean 
for non- 
changers 
(N=12) 


rted | 


raise LOA as reper 
shown 
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In view of these negative findings,* 
the question naturally occurs as to what 
factors were involved in the significant 
relationships reported by Thistlethwaite 
(1962) using retrospective LOA and 
the analysis of covariance. He found 
(p. 59) that covarying on _ initial 
(retrospective) LOA did not remove all 
of the variance associated with the sub- 
ject’s initial major field. As a means of 
further controlling this “input” variable, 
the subjects were then separated into 
science and nonscience majors. On this 
basis, the Faculty Press for Independ- 
ence was still significantly related to 
adjusted mean level of aspiration within 
the nonscience group. (Results of the 
science, nonscience analysis for the five 
other presses shown in Table 3 were not 
reported, so it is assumed that the rela- 
tionships disappeared with the addition 
of this third input variable.) However, 
certain other results (Thistlethwaite, 
1962) suggest that the division of sub- 
jects into science and nonscience groups 
may not have been a sufficient control 
for initial major field. It happens that 
two of the specific major field groups 
included in the nonscience classifica- 
tion, philosophy ani education, had, 
respectively, the highest and lowest ad- 
justed mean LOAs of all the specific 
major field groups (pp. 58-59). In 
Thistlethwaite’s (1962) Table 1 (p. 56), 
which shows the relationships between 
major fields and some of the presses, it 
can be seen that philosophy—the group 
with the highest adjusted mean LOA— 
also has one of the highest mean scores 
on Faculty Press for Independence. 
Similarly, education—the group with 
the lowest adjusted mean LOA—has 

* These reanalyses were performed with only 
24% of Thistlethwaite’s male sample. Although 
there is no reason to believe that the smaller 
sample is a biased one, the possibility remains 
that the results would have been different if the 
entire sample had been used. 


the lowest mean of all the nonscience 
groups on Faculty Press for Independ- 
ence. Perhaps the most likely expla- 


nation of the significant relationships 
involving this press is that there were 
still differences with respect to initial 
major field of study which had not been 
adequately controlled. 


DISCUSSION 


The present results suggest that the 
college press does not affect the stu- 
dent’s level of aspiration in the manner 
previously supposed. On the other hand, 
these findings should not be interpreted 
as evidence that the college environ- 
ment is unimportant. Indeed, on strong 
a priori grounds we should expect stu- 
dents to be affected differently by dif- 
ferent college environments. But since 
we cannot assign students randomly and 
manipulate their college environments 
directly, we cannot conclude that dif- 
ferential environment influences have 
been identified until we have adequately 
controlled those input variables which 
are known to be related to the student 
performance in question. 

The importance of such controls can 
hardly be overestimated. First of all, 
since the college press (Pace & Stern, 
1958) is closely related to certain char- 
acteristics of the student body (Astin 
& Holland, 1961), the effects of these 
presses cannot be properly assessed with- 
out first controlling for student input. 
Secondly, the control of student input 
should eventually lead us closer to what- 
ever “real” effects might exist. Recent 
research on the PhD “productivity” of 
colleges (Astin, 1962), for example, has 
shown that some of the colleges which 
were previously called “highly produc- 
tive” (Knapp & Greenbaum, 1953) turn 
out to be among the most “underpro- 
ductive” when certain aspects of their 
student inputs are taken into account. 
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A final methodological point re- 
garding studies of this type should be 
mentioned. If we seek to learn what as- 
pects of the college environment influ- 
ence the student, we should try to keep 
our measures of the environment and 
our measures of student performance 
experimentally independent. Thistle- 
thwaite (1962) speaks of “students ex- 
posed to some educational treatments” 
and of “teachers who exert press for 
independence” (p. 63), almost as if 
the press were an experimentally ma- 
nipulated independent variable. The 
fact is, that the press for any student 
was defined solely in terms of the stu- 
dent’s own verbal report. It is possible, 
for example, that students who regard 
their professors positively would be more 
inclined to attribute beneficial effects to 
these same professors than students who 
are prone to be critical of their profes- 
sors. To be sure, such effects might 
occur, but they will be difficult to dem- 
onstrate in any convincing fashion as 
long as the student defines his own 
status simultaneously on both the inde- 
pendent and dependent variables. 
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RIVAL HYPOTHESES FOR EXPLAINING THE 
EFFECTS OF DIFFERENT LEARNING 
ENVIRONMENTS * 


DONALD L. THISTLETHWAITE 
Vanderbilt University 


explanations advanced to show the artifactual nature of previously 
reported effects of different learning environments are critically examined. 
Longitudinal data on a sample of 2405 undergraduate men confirm and 
extend the previous findings and rule out interpretation of the findings in 
terms of retrospective errors of recall or in terms of alleged anomalies 
related to covariance analysis. Men who report that their teachers exert 
strong press for enthusiasm, humanism, affiliation, independence, achieve- 
ment, and supportiveness, or who exert weak press for compliance, tend 
to raise their aspirations for advanced training more than men not 


reporting such press 


In his paper Astin (1962) has ques- 
tioned whether my earlier report 
(Thistlethwaite, 1962) has provided 
evidence that college press influences the 
student’s motivation to seek advanced 
training. Astin proposes two hypotheses 
as alternative explanations for my re- 
sults: (a) the observed differences were 
spurious due to inaccuracies in the 
retrospective recall of degree aspirations 
at the beginning of college, (5) the ob- 
served differences were spurious due to 
an inappropriate application of covari- 
ance analysis. 

I shall show that neither of these 
rival hypotheses is plausible because of 
weaknesses in the argument and analy- 
sis presented by Astin and because 
neither receives support from more re- 
cent results obtained by the author. 
With regard to the first of Astin’s rival 
hypotheses I shall argue: (a) his data 


1The research briefly reported herein was 
supported through the Cooperative . Research 
Program of the Office of Education, United 
States Department of Health, Education, and 
Welfare. In addition, the collection of longi- 
tudinal data was partially supported by Na- 
tional Science Foundation, Old Dominion 
Foundation, and Ford Foundation grants to the 
National Merit Scholarship Corporation. 


do not provide convincing evidence for 
the inaccurracy of retrospective recall; 
(6) he has failed to show how alleged 
errors of recall could account for my 
earlier results; and (c) the alleged 
errors of recall are irrelevant in any case, 
since when they are ruled out the previ- 
ously reported effects are replicated. 

With regard to the second rival hy- 
pothesis I shall argue: (c) Astin has 
failed to show how my use of covariance 
analysis could have produced spurious 
results; (5) no valid inferences con- 
cerning this hypothesis can be drawn 
from the results of Astir.’s “reanalysis” 
(in Table 3 of his paper) because of 
sampling considerations; and (c) the 
results of a new analysis, based upon 
a large sample of undergraduates, pro- 
vide no support for it. 


ALLEGED Errors OF RECALL 


Anyone familiar with the mail sur- 
veys made of Merit Finalists by the 
National Merit Scholarship Corporation 
knows that the returns typically come 
back over a 6-month interval or more. 
Consequently precise localization of the 
time period to which student reports 
refer is impossible. However, such lo- 
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calization is critical for the kind of com- 
parisons Astin attempts to make. In de- 
scribing the procedures used in polling 
the 1957 Finalists and commended stu- 
dents, Astin (1962) states: 

These students were asked their LOA [level of 
aspiration] shortly after they entered college 
in Fall 1957, and again as they were complet- 
ing college in 1961. In the 1961 survey they 
were also asked to recall what their LOA had 
been in 1957 (p. 305). 

No matter how adroit the request for 
retrospective recalls in 1961, it is un- 
likely that each student was being asked 
to recall what his dispositions were dur- 
ing the same time interval to which his 
report in 1957 referred. In short, if 
there was any ambiguity regarding the 
specific point in time students were 
being asked to recall, then discrepancies 
of the kind Astin points to do not neces- 
sarily show errors of recall. Since there 
are ample grounds for assuming such 
ambiguity existed, it follows that Astin’s 
(1962) data summarized in Tables 1 and 
2 of his paper do not provide convinc- 
ing evidence of the unreliability of retro- 
spective reports.” 


IMPORT OF ALLEGED Errors or RECALL 


Even if we grant that Astin’s data 
raise questions about the reliability of 
retrospective reports, it does not follow 
that such unreliability could explain the 
observed differences. On the contrary, 


unreliability of measurement should 
work against the demonstration of the 
hypothesized effects. The more unre- 
liability in the data the harder it should 
be to demonstrate significant differences 
between press groups. If effects are ob- 


21 called this ambiguity to Astin’s attention 
in correspondence prior to the revision of his 
paper. Since he has not commented on the 
asynchronism in the objects of the “actual” 
and retrospective reports I assume this am 
biguity cannot be ruled out 


tained with unreliable measures, they 
should be even greater with more reliable 
measures. 

To be sure, Astin argues that the 
“changers” showed more inaccurate re- 
call than the “nonchangers.” However, 
my comparisons were not among groups 
dichotomized in this manner. Further- 
more, no attempt has been made by 
Astin to show how the alleged system- 
atic biases in recall could produce the 
“effects” observed in my earlier study. 


COVARIANCE ANALYSIS WITH ERRORS 


oF RecaLt Rutep Out 


Even though we cannot imagine how 
inaccurate recall could have produced 
spurious results in the earlier study, it 
is worth asking whether the “effects”’ 
of different learning environments would 
disappear if potential errors of recall 
were ruled out. Table 5 of my earlier 
report (Thistlethwaite, 1962) describes 
the observed effects when the covariate 
(initial LOA) is based upon data ob- 
tained by the method of retrospective 
reall. A new analysis in which the 
covariate (initial LOA) is based on 
“actual” measures obtained at the be- 
ginning of college is reported below. If 
the latter analysis fails to confirm the 
former, then we shall have evidence that 
errors of recall may have somehow spuri- 
ously produced the effects. On the other 
hand, if the latter analysis confirms the 
former, then we shail have evidence 
that alleged errors of retall were not 
responsible for the effects. 

Additional analyses, based on a sam- 
ple of approximately 2,400 men enrolled 
in 140 colleges and universities, are 
shown in Table 1. These students were 
asked their LOA shortly after they en 
tered college in Fall 1959, and again 
after they had completed 2 years of col- 
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TABLE 1 


COVARIANCE ANALYSIS FOR COMPARING CHANGES 
In ASPIRATION AMONG MEN EXPOSED TO 
DIFFERENT COLLEGE PREsS 


Press reported 

by group with 

higher adjusted 
final LOA 


Basis for classifying 
students into press 
groups * 


F ratio 


Faculty press 
Enthusiasm 
Humanism 
Affiliation (+-)” 
Independence (-++- ) 
Compliance 
Vocationalism ( — ) 
Achievement strong 
Supportiveness (-+-) strong 5.74* 
Directiveness 01 

Student press 
Estheticism 
Reflectiveness 


strong 
strong 
strong 
strong 
weak 
weak 


24.48** 
7.32** 


strong 
strong 


Social conformity 4l 
Competition 23 


* Results for comparisons based upon classification 
by faculty press for compliance and by student press 
for estheticism and reflectiveness should be interpreted 
with caution because of heterogeneity of regression 
among treatment groups 

> Direction of statistically significant relationships 
between press and increase in LOA found in earlier 
study. 

*p< .05. 

** p< Ol. 
lege, in June 1961.* Since these students 
were not asked to recall their earlier 
degree aspirations, but were asked about 
their current plans on two independent 
occasions, we can rule out errors of recall 
in the data. Revised forms of 9 of 
the 10 faculty press scales and 4 of the 
10 student press scales~used in the 
earlier study were included in the scales 
administered to students in the new 
sample. Only results for the 13 over- 
lapping press scales are shown in 
Table 1. 

It can be seen that the results con- 


firm and extend the findings in my 

®This sample was drawn from a larger 
sample of students participating in the 1959 
National Merit Scholarship Qualifying Test. 
The characteristics of the larger sample have 
been described elsewhere by Thistlethwaite 
(1960). 


earlier report. First, each of the four 
kinds of faculty press (affiliation, inde- 
pendence, vocationalism, and suppor- 
tiveness) previously associated with in- 
crease in LOA was also found to be 
significantly related in this analysis. The 
directions of the relationships are iden- 
tical in both studies. On one scale (sup- 
portiveness) the significance level is 
lower, but on two (affiliation and voca- 
tionalism) the significance level is 
higher, compared to the earlier results. 
Second, both analyses agree in failing 
to reject the null hypothesis for three 
scales (directiveness, social conformity, 
and competition). Third, at least three 
additional significant relationships ap- 
peared; increase in LOA was associated 
with strong faculty press for enthusi- 
asm, humanism, and achievement. Dif- 
ferences appearing on the remaining 
scales are ambiguous because of failure 
to meet the assumption of homogeneity 
of regression required for analysis of 
covariance. 

Clearly the previously reported asso- 
ciations with faculty press did not dis- 
appear when errors of recall were ruled 
out; on the contrary, they appeared 
again, and were accompanied by addi- 
tional faculty press associations. The 
fact that additional effects were sug- 
gested in this analysis could be due to 
a number of factors: increased relia- 
bility of the measures of intial LOA 
(as previously noted, an increase in the 
reliability of measurement should make 
it easier to discover effects), increased 
sample size, or the somewhat greater 
range of talent represented in this sam- 
ple. In any case, it seems clear that 
these results clearly rule out the first of 
the rival hypotheses proposed by Astin. 


APPROPRIATENESS OF COVARIANCE 
ANALYSIS 
Astin raises two questions concerning 
the appropriateness of the method of 
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analysis in my earlier report: First, 
were the assumptions of covariance anal- 
ysis satisfied? Second, was it proper 
to map the set of aspiration levels into 
an interval scale? 

Three assumptions are commonly 
cited as prerequisities for use of simple 
linear covariance adjustment: linearity 
of regression, homogeneity of regression, 
and homogeneity of variance in the 
residual errors of estimate. Tests were 
made of the tenability of the last two 
of these assumptions in the analysis re- 
ported earlier. Except where noted these 
assumptions were satisfied. Since the 
consequences of departure from linearity 
of regression should be to overestimate 
the residual errors of estimate, and 
probably to underestimate the F ratios, 
it was simply assumed that the rela- 
tionship between pretest and posttest 
LOA was linear. The heteroscedasticity 
noted by Astin probably means that the 
residual errors of estimate are not nor- 
mally distributed. Cochran (1947) and 
Lindquist (1953) have discussed the 
effects of nonnormality upon the F tests 
used in analysis of variance, and the 
related work presumably carries over— 
for instance, similar conclusions prob- 
ably apply for the effects of nonnor- 
mality in residual errors of estimate in 
covariance analysis (Cochran, 1957). 
The general import of this discussion is 
that the lack of normality does not pre- 
clude use of the F distribution, particu- 
larly if the tabled probabilities are 
doubled. Accordingly two-tailed tests 
of significance were used throughout in 
the earlier study, though most of the 
hypotheses were directional in nature. 

A simple, dogmatic answer cannot be 
given to the second of the above ques- 
tions. Some investigators prefer to map 
their data into an interval number sys- 
tem; others do not. Some justify map- 
ping into a strong system of analysis on 
the basis of pragmatic results; others 


maintain that we should let the data 
determine the level of measurement em- 
ployed. However, even if we grant that 
there are reasonable doubts about the 
appropriateness of mapping the LOA 
levels into an interval scale, Astin has 
not shown how such mapping could pro- 
duce spurious results. A simple way of 
determining whether the results are 
specific to the method of analysis 


adopted is to repeat the comparisons 
using a different method of analysis. 


RESULTS OBTAINED WITH AN ALTERNA- 
TIVE Mope or ANALYSIS 


Table 2 shows that the overall effects 
are very similar when we analyze the 
data by the alternative method used 
by Astin. In Table 2 mean press scores 
on each scale have been compared for 
subgroups which had the same pretest 
LOA but which differed in degree of 
change in LOA over a 2 year interval. 
To make these tests comparable to the 
overall F tests obtained in Table 1 (and 
in the earlier report) combined tests of 
significance were calculated by a method 
described by Mosteller and Bush (1954, 
p. 330) which permit one to weight the 
normal deviates in proportion to the 
sizes of the subgroups. 

Both Tables 1 and 2 show that in- 
crease in LOA was associated with: 
(a) strong faculty press for enthusi- 
asm, humanism, affiliation, independ- 
ence, achievement, and supportiveness; 
(6) weak faculty press for compliance; 
and (c) strong student press for 
estheticism. The second analysis did 
not confirm the association with faculty 
press for vocationalism or with student 
press for reflectiveness, but did suggest a 
significant association with faculty press 
for directiveness.* 


* Actually, the results on the scales for 
faculty press for vocationalism and student 
press for reflectiveness are confirmed if we use 
one-tailed tests of significance. 
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TABLE 2 


COMPARISON OF AVERAGE Press Scores oF MEN SHOWING DirrerENT CHANGES IN ASPIRATION 


Sign and value of ¢ for differences between means 


(mean for subjects having same or higher LOA 
in 1961 minus mean for subjects having 
lower LOA in 1961)* 


Bachelor’s 
(1) 


Press scale 


Faculty press 
Enthusiasm (+-)” 
Humanism (-+-) 
Affiliation (+-) 
Independence (-++-) +2.88** 
Compliance (— ) 6.35** 
Vocationalism ( — ) 95 
Achievement (+) +1.98* 
Supportiveness (-+-) +2.36* 
Directiveness (ns) 52 

Student Press 
Estheticism (+) 

Reflectiveness (+-) 
Social conformity (ms) 
Competition (ns) 


+2.32* 
+4.32** 
4+3.17** 


Master’s 
(2) (3) 


43.75** 
4+4.17** 
+3.32** 
+5.16** 
— 1.63 

+ 0S 

+2.47* 
+2.30* 
4+4.57%* 


+-3.58** 
+1.41 
+1.21 
+ .73 


Initial LOA in 1959 


Combined 
test of 
significance 


Doctorate 
(PhD or the 
equivalent) 

(4) (5) 


Professional 
degree (MD, 
DDS, etc.) 


+4.00** 
+6.50** 
+3.95%* 
+5.60** 
—6.40** 
—1.89 
4+3.13** 
+3.39%* 
+2.26* 


+1.67 — 17 
+4.19** 48 
+ 30 ll 
+2.57* 05 
—1.97* — 83 
— 2.45* 65 
+2.02* - .74 
+2.14* 69 
+1.68 2.73** 


+2.67** 
+1.78 
+ 38 
+1.85 


+2.54* 50 
+1.88 » £2 
+ 37 — 1,02 
+1.25 -4.31** 


® Signs in Column 1 are for the difference obtained by subtracting the mean for subjects having the same or 


lower LOA from the mean for subjects having higher LOA in 1961. 


Ns for the groups having higher LOA in 


Columns 1—4 were 341, 398, 375, and 235, respectively; Ns for groups having lower LOA in Columns 1-4 were 


553, 161, 141, and 168, respectively. 


> Direction of relationship between press and increase in LOA found in Table 1 


*p<.05. 
** p< 01. 

The individual ¢ values show that 
faculty and student press were virtually 
unrelated to the changes in LOA of stu- 
dents who entered college with plans to 
seek the PhD. On the other hand, 
faculty and student press were highly 
related to changes in aspirations of stu- 
dents who entered college with plans to 
seek the MD or lower-level degrees. 

It is clear that the effects reported 
in my earlier study were not specific to 
the method of analysis used; it may 
therefore be concluded that there is no 
evidence that those results were spuri- 
ous due to my use of covariance pro- 
cedures. 


SomME SAMPLING CONSIDERATIONS 


Finally, brief comment should be 
made on the statistical tests reported 
in Table 3 of Astin’s paper. His sub- 
groups have Ns of 6 and 14, at the 
bachelor’s level, and Ns of 12 and 11 
at the master’s level. Now we have just 
seen the effects of faculty press appear 
primarily among students who do not 
initially plan to seek the PhD or its 
equivalent. It is hardly surprising that 
the null hypothesis was not rejected with 
such small samples particularly at the 
lower LOA levels. My earlier paper spe- 
cifically called attention to the attenua- 
tion in the range of variation of the in- 





RivaL HypoTHeses REGARDING LEARNING ENVIRONMENTS 315 


dependent and dependent variables; it 
should also have been apparent from the 
epsilon values reported that I was deal- 
ing with relatively weak effects. Further- 
more, Astin made no attempt to obtain 
combined tests of significance (over 
levels), so that the results would be com- 
parable with the F ratios obtained in the 
covariance analysis. Consequently it is 
difficult to see how Astin could have ex- 
pected the null hypothesis to be rejected 
in these comparisons.’ In any case, his 
results are clearly less definitive than 
those which we have just considered in 
Table 2. 


DISCUSSION 


Since all of the studies so far reported 
on college productivity, or on changes 
in motivation for advanced training, 
have been based upon correlational 
data, there remains a reasonable doubt 
about the validity of amy causal hy- 
pothesis advanced as an explanation for 
the observed relationships. I am dis- 


posed to explain these relationships as 
being due to the kinds of demands, 
images, and pressures which teachers 
and peers create, while Astin is dis- 
posed to see the explanation in terms 


of precollege characteristics. We can 
settle controversies about specific rival 
hypotheses (as I believe we have in this 


5 A similar overreliance on small sample sta- 
tistics is revealed in Astin’s (1961) earlier 
paper, in which he has argued that precollege 
characteristics accounted for some of the rela- 
tionships between faculty press and college pro- 
ductivity. The analysis reported there entailed 
second-order partial correlations based upon Ns 
of 36. However, partial correlations are no- 
toriously affected by errors in measurement 
and little confidence can be placed in the 
partials when the basic r’s are so unreliable 


exchange), but there is of course no way 
of ruling out all possible conjectures 
about rival interpretations. Since the 
number of “input” variables is poten- 
tially infinite one can always interpret ef- 
fects in terms of new, uncontrolled in- 
put characteristics. In the hope of 
obtaining some leads about methods of 
making teaching more effective I am 
willing to risk the temporary acceptance 
of one or more false hypotheses. How- 
ever, nothing that Astin has presented 
convinces me he has yet identified any 
viable hypotheses that are genuine rivals 
to the hypothesis that learning environ- 
ments shape in some degree the stu- 
dent’s motivation for advanced training. 
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SEX, SOCIAL CLASS, AND ANXIETY AS SOURCES OF 
VARIATION IN SCHOOL ACHIEVEMENT * 


BEEMAN N. PHILLIPS 
University of Texas 


The hypothesis that social class interacts with sex and anxiety to produce 
differences in school achievement was tested utilizing a sample of 759 
adolescents classified into 8 subsamples involving 2 levels of anxiety and 
social class, and both sexes. 8 measures of school achievement were 
employed, including standardized tests and teacher grades. The results 
supported 2 major findings of previous research: females had higher 
anxiety scores than males, and highly anxious Ss had lower achievement 
and intelligence scores. Both 1st and 2nd order interactions between sex, 
social class, and anxiety were found. Difficulties were encountered in 
integrating these findings into anxiety theory, and certain proposals are 


offered to accomplish this. 





Despite the great number of studies 
which have been reported, it is still diffi- 
cult to reach definitive conclusions re- 
garding anxiety and its relationships 
with school attitudes, behavior, and 
achievement. One of the reasons for 


this is that sex differences in the con- 
sequences of anxiety frequently have 


been reported (e.g., McGuire, Hinds- 
man, King, & Jennings, 1961; Phillips, 
Hindsman, & McGuire, 1960; Sarason 
Davidson, Lighthall, Waite, & Ruebush, 
1960). In addition, it has been found 
that sex interacts with a variety of fac- 
tors in these differences in the conse- 
quences of anxiety (e.g., Grooms & 
Endler, 1960; Phillips, Hindsman, & 
Jennings, 1960; Reese, 1961). Finally, 
no theory of anxiety seems capable of 
explaining these multiple results. 

In this situation it appears that a 
further step which needs to be taken is 
the investigation of factors which have 
relationships with sex, school achieve- 


1 The research reported here was performed 
pursuant to OE-2-10-003 with the United 
States Office of Education, Department of 
Health, Education, and Welfare. Also, the 
author wishes to express his appreciation to 
G. McBee who supervised IBM 650 analyses 
of data. 


ment, and anxiety. One such factor is 
social class, which is clearly related to 
all three; and it was the purpose of this 
study to investigate social class, sex, 
and anxiety as interrelated factors in 
school achievement. The investigation 
was designed to test a general hypothe- 
sis,.which.was that differences in school 
achievement would be associated with 
interactions of sex, social class, and 
anxiety. In addition, the implications 
of such findings with regard to prevail- 
ing conceptualizations of anxiety, and 
its relationships with school achieve- 
ment, were analyzed. 


METHOD 


The 759 subjects of this study were selected 
from approximately 1,500 seventh grade stu- 
dents in four Texas communities who form the 
total population of the University of Texas 
Human Talent Project. The instruments used 
included the California Test of Mental Ma- 
turity (CTMM); the Language Arts, Mathe- 
matics, and Reading subtests of the California 
Achievement Test (CAT); the Social Studies 
and Science subtests of the Sequential Test of 
Educational Progress (STEP); and the Cas- 
taneda-McCandless Anxiety Scale, with items 
adapted for use with adolescents. An index of 
social class based on source of income, occupa- 
tion, and education of the parents also was 
computed for each subject (McGuire et al., 
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190i). In addition, teacher grades were ob- 
tained for the subject areas of language arts, 
mathematics, and social studies at the end of 
the school year. All of these variables were 
linearly transformed to stanine values so that 
each had a mean of 5 and a standard deviation 
of 1.5. 

Basically a fixed 2% 2X2 design 
utilized with two levels of social class 
anxiety, and two sexes. In order to achieve a 
clearer differentiation between subsamples, the 
subjects with stanines of 5 on social class or 
anxiety were not included in the analyses 
Therefore, the “high” levels of social class 
(middle class) and anxiety included subjects 
with stanine values of 6 or greater, and the 
“low” levels of social class (lower class) and 
anxiety included subjects with stanine values of 
4 or less. Thus, there were a total of 8 sub- 
samples drawn from the total population. An- 
other factor, type of achievement measure with 
two categories, teacher grades and objective 
tests, was added to the basic design. 

Two different approaches to school achieve- 
ment were employed. The first was based on 
the relationship between intelligence and 
achievement, which was determined by com- 
puting zero-order correlation coefficients be- 
tween total IQ and each of the eight achieve- 
ment scores, and then performing an analysis 
of variance of Fisher z transformations of these 
correlations. 

The other approach was based on the level 
of achievement reached, and for this means 
and standard deviations were computed on all 


was 
and 
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achievement variables, and ¢ tests and an 
analysis of variance were performed on these 
means 

In addition, chi square analyses were made 
of the distributions of high and low anxiety 
scores; intelligence means were computed and 
tested for significance; and differences between 
intelligence and achievement means were deter- 
mined 


RESULTS 


The analyses of the data are presented 
in the following order. The results with 
respect to the correlations are given in 
Tables 1 and 2. Table 1 gives the actual 
coefficients and Table 2 shows the re- 
sult of the analysis of variance of 
Fisher z transformations. Then in 
Table 3, the results with respect to the 
mean differences in achievement be- 
tween the subsamples are given, while 
Table 4 presents an analysis of variance 
of these means. Finally, Table 5 gives 
the IQ means and standard deviations 
of the subsamples; Table 6 presents a 
chi square analysis of the sex and so- 
cial class distribution of the high and 
low anxiety scores; and Table 7 shows 
differences between mean _ intelligence 
and mean achievement scores. 


TABLE 1 


° 
CorrELaTIoNs OF IQ with ACHIEVEMENT VARIABLES 


Males 


High social 
class 


Low 
anxiety 


High 
anxiety 


Achievement 
variables 


Teacher grades 
Social studies 
Mathematics 
Language arts 
Objective tests 
STEP Social Studies 
CAT Mathematics 
CAT Language Arts 
CAT Reading 
STEP Science 
N 


“Aawunw 
“uS&o 


Note.—Decimal points have been omitted 


High 
anxiety 


Females 


Low social 


Low social 
class 


High social 
class l 


ciass 


Low 
anxiety 


High 


anxiety 


Low 
anxiety 


High 
anxiety 


Low 
anxiety 


+ & « 


~ 
ow MUM w 
owe & ‘ 
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TABLE 2 


ANALYSIS OF VARIANCE OF FISHER z TRANSFORMATIONS OF CORRELATION COEFFICIENTS 


Source 

Sex 

Social class 

Anxiety 

Achievement 

Sex & social class 

Sex < anxiety 

Sex achievement 

Social class anxiety 

Social class achievement 

Anxiety achievement 

Sex X social class < anxiety 

Sex X social class & achievement 
Sex X anxiety achievement 

Social class * anxiety achievement 
Sex * social class anxiety achievement 
Within 


* p< 01 


Now, summarizing the results of 


Table 2, it may be stated that: 

1. Three main effects were statisti- 
cally significant: IQ was more highly 
correlated with achievement for males 
than for females, IQ was more highly 
correlated with achievement in the mid- 
dle class than in the lower class, and 


F 


Qa 
—~ 


75.09** 
96.17** 
244.39** 
81.65** 
17.09** 
2.30 
29.04** 


18.61** 


CO ee eee ee 


te 


IQ was more highly correlated with ob- 
jective tests than with teacher grades. 

2. Three first order interactions also 
were statistically significant: an increase 
in social class resulted in a higher cor- 
relation of IQ with achievement for 
males than for females, an increase in 
anxiety resulted in an increase in the 
correlation of intelligence with achieve- 


TABLE 3 


SUBSAMPLE MEANS AND STANDARD DEVIATIONS FOR THE ACHIEVEMENT VARIABLES 


Males 


High social 
class 


High 
anxiety 


Low High 
anxiety 


Achievement 7 _ 


variables zm zw 2 


Teacher grades 
Social studies 1 
Mathematics 8 1 
Language arts l 
Objective tests 
STEP Social 5 1.6 
Studies 
CAT Mathematics s&s 
CAT Language 5.1 1 
» © 
6 1 


1 
l 
l 


Now 


s 


CAT Reading 


1 
l 
1 
STEP Science 1 


sw 


1 
1 
1 
1 


* Difference between this high 
umn) statistically significant at or beyond the .05 level 


Low sox ial 
class 


anxiety 


SD 


4 
6 
3 


i.3 


5 
4 
5 


anxiety subsample mean and the low anxiety subsample 


Females 


Low social 
class 


High social 
Class 


Low 
anxiety 


High 
anxiety 


Low 
anxiety 


High 
anxiety 


Low 
anxiety 


X SD SD SD SD X SD 


1.3 1 
1.3 l 
1.¢ l 


11 ¢ 1 
1.2 
1.4 5.3 
13060 —6S.8° 
1.3 


1.4 
1.4 
1.3 
5.4° 1.1 


1.1 
12 
1.1 
14 


mean (in the next 





Factors IN SCHOOL ACHIEVEMENT 319 


TABLE 4 


ANALYSIS OF VARIANCE OF ACHIEVEMENT 
VARIABLE MEANS 


Source 


Sex 

Social class 

Anxiety 

Sex X social class 

Sex X anxiety 

Social class X anxiety 

Sex < social class anxiety 
Within* 


* Based on the eight eight 
mean achievement scores 


as 


subsamples the 


TABLE 5 


SUBSAMPLE IQ MEANS AND STANDARD 
DEVIATIONS 


Subsample 


Male 
High social class, high anxiety 
High social class, low anxiety 
Low social class, high anxiety 
Low social class, low anxiety 
Female 
High social class, high anxiety 
High social class, low anxiety 
Low social class, high anxiety 
Low social class, low anxiety 


* Difference between this mean and the mean directly 
below it statistically significant at beyond the .05 
evel 


TABLE 6 


Cur Square ANALYSES OF DISTRIBUTION OF 
Anxiety Scores spy Sex AND SociAL CLAss 


Anxiety 


Subsample High Low Chisquare 
High -social class 
Males 
Females 
Low social class 
Males 
Females 
Males 
High social class 
Low social class 
Females 
High social class 
Low social class 


01; df=1 


ment for males but a decrease in this 
correlation for females, and an increase 
in social class resulted in a much higher 
increase in the correlation of IQ with 
objective tests. 

3. Finally, two second order inter- 
actions were statistically significant: 
for males, an increase in social class re- 
sulted in a greater increase in the cor- 
relation of IQ and objective tests, while 
for females, an increase in social class 
resulted in an increase in the correla- 
tion of IQ and teacher grades, but there 
was no change in the correlation of 
IQ and objective tests; and for males, 
an increase in anxiety resulted in an 
increase in the correlations of IQ and 
teacher grades, and IQ and objective 
tests, while for females, an increase in 
anxiety resulted in an increase in the 


TABLE 7 


SUBSAMPLE DIFFERENCES BETWEEN Mean IQ 
AND THE MEAN OF THE ACHIEVEMENT MEANS 


Teacher 
grades * 


Objective 
Subsample test ° 
Male 
High social class, 
high anxiety 
High social class, 
low anxiety 
Low social class 
high anxiety 
Low social class 
low anxiety 
Female 
High social class, 
high anxiety 
High social class, 
low anxiety 
Low social class, 
high anxiety 
Low social class, 
low anxiety 


* Difference obtained by determining the mean of 
the three teacher grade means and subtracting the mean 
1O 

> Difference obtained by determining the mean of 
the five objective test means and substracting the mean 
1Q 

* Difference is statistically significant 
the .05 level 


at or beyond 
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correlation of IQ and teacher grades 
but a decrease in the correlation of IQ 
and objective tests. 

Referring, now, to Table 4, the fol- 
lowing results may be noted: 

1. Three main effects were statisti- 
cally significant: females had higher 
achievement than males; middle-class 
subjects had higher achievement than 
lower-class subjects; and subjects with 
low anxiety had higher achievement 
than subjects with high anxiety 

2. Three interactions were statisti- 
cally significant: an increase in anxiety 
resulted in lower achievement for fe- 
males and slightly higher achievement 
for males; an increase in anxiety re- 
sulted in lower achievement for middle- 
class subjects but had no appreciable 
effect on the achievement of lower-class 
subjects; and an increase in anxiety re- 
sulted in an increase in the achievement 
of lower-class males and a decrease in 
the achievement of lower-class females, 
while an increase in anxiety resulted in 
a larger decrease in the achievement 
of middle-class females than it did in 
middle-class males. 

Next, in reference to Tables 5 and 6, 
two major findings may be reported: 
first, an increase in anxiety among fe- 
males resulted in lower intelligence 
scores in both social classes, while an 
increase in anxiety among males resulted 
in higher intelligence scores in the lower 
class, and made little difference in the 
middle class; second, there were pro- 
portionately more males and females 
with high anxiety in the lower class than 
in the middle class, although in the 
lower class a larger proportion of fe- 
males than males had high anxiety. 

Finally, from the results of Table 7, 
it was found that: for males in the mid- 
dle class, mean teacher grades were 
lower than mean intelligence in both 
high and low anxiety groups; for males 


in the lower class, mean teacher grades 
were lower than mean intelligence in 
only the low anxiety group; for females 
in the middle class, there were no differ- 
ences between mean teacher grades and 
mean intelligence in either high or low 
anxiety groups; for females in the lower 
class, mean teacher grades were higher 
than mean intelligence in the low anx- 
iety group, with no mean difference in 
the high anxiety group; and lastly, dif- 
ferences between standardized achieve- 
ment and intelligence were insignificant . 
in all subsamples. 


DISCUSSION 


Viewed in relation to the overall find- 
ings of past anxiety research, it is evi- 
dent that the general results of this 
study were in agreement on two major 
points. Sex differences in level of anx- 
iety were obtaine“, with males having 
lower anxiety scores than females; and 
differences in the achievement of subjects 
with high and low anxiety were found, 
with high anxiety being associated with 
lower achievement. However, the signifi- 
cance of these areas of agreement is 
diminished by the finding that both the 
level of anxiety and its relationship to 
school achievement depended on the in- 
teraction of sex and social class. 

As the first indication of this inter- 
action, sex differences in level of anxiety 
occurred only in the lower social class, 
where females had significantly higher 
anxiety scores than males. In addition 
to this, it should be noted that middle- 
class and lower-class males differed less 
in anxiety scores than middle-class and 
lower-class females. 

As a further indication of the inter- 
action of sex and social class, lower-class 
males with high anxiety had higher in- 
telligence scores than lower-class males 
with low anxiety, although middle-class 
males with high anxiety did not differ 
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significantly in intelligence from middle- 
class males with low anxiety. Further- 
more, in both social class levels females 
with high anxiety had lower intelligence 
scores than females with low anxiety. 

Finally, the interaction between sex 
and social class was evident in relation- 
ships between anxiety and _ school 
achievement. In general, anxiety as a 
debilitating factor was more evident in 
relation to teacher grades than in rela- 
tion to achievement tests, with the 
achievement of lower-class males being 
less affected than the achievement of 
other adolescents. 

Considering what these findings may 
mean in terms of anxiety theory, Sara- 
son, Davidson, Lighthall, and Waite 
(1958) have speculated that males ob- 
tain lower anxiety scores than females 
because they are more reluctant to ad- 
mit anxiety. As they stated it, anxiety 
is “ego-alien” to males and “ego-syn- 
tonic” to females. In other words, the 


expression of anxiety is incongruous to 
the male’s conception of himself, and 


this causes him to be more defensive 
about admitting anxiety. Accepting this, 
the results of this study may be ex- 
plained by assuming that social class is 
a factor in whether anxiety becomes 
ego-alien or ego-syntonic. More spe- 
cifically, the greater differentiation of 
sex roles in the lower class, particularly 
with respect to masculinity, causes the 
ego-alien aspects of anxiety to be more 
highly developed and delineated in 
lower-class males. 

Although sex and social-class differ- 
ences in level of anxiety seem to fit into 
the ego-alien and ego-syntonic rationale, 
more difficulty is experienced in trying 
to explain the role of sex and social 
class in relationships between anxiety 
and school achievement. According to 
the rationale, anxiety should have greater 
disruptive effects on males, due to the 
greater ego-alien nature of anxiety in 


males. But, as previously noted, anxiety 
appeared to interfere less with achieve- 
ment of males than females, especially 
in the lower class. Thus, whether anx- 
iety produces responses which are com- 
patible or incompatible with school 
achievement tasks seems to depend upon 
social class. More to the point, the 
social class related experiences of lower- 
class males appear to be especially sig- 
nificant in whether anxiety has facilita- 
tive effects. 

Finally, investigators espousing drive 
conceptions of anxiety have been par- 
ticularly interested in the performance 
of highly anxious and less anxious sub- 
jects on complex and simple tasks. Ac- 
cording to the Hull-Spence drive theory, 
anxiety should interfere with perform- 
ance on complex tasks, and if tasks on 
intelligence tests can be considered com- 
plex, the negative relationships usually 
obtained between anxiety and _ intelli- 
gence appear to be consistent with this 
view. However, the results of this study 
did not altogether conform to the pre- 
diction of drive theory since lower-class 
males with high anxiety had higher in- 
telligence scores than lower-class males 
with low anxiety. Apparently, then, the 
relationship between anxiety and intelli- 
gence is not uniformly negative across 
samples differentiated in terms of sex 
and social class, which raises the ques- 
tion of whether these factors in some 
way influence the drive properties of 
anxiety. 

In summary, it is apparent that sex 
differences are not easily integrated into 
prevailing theories about anxiety, and 
that the interaction of sex, social class, 
and anxiety represents a particular chal- 
lenge to existing theoretical formula- 
tions. Furthermore, it is argued that 
an integration of research findings con- 
cerning anxiety and ego involvement, 
and social class and school achievement, 
offers the best opportunity for a con- 
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ceptualization which will successfully 
handle such differences. Toward this 
end, the following facts and specula- 
tions are presented: 

1. It is known that middle-class sub- 
jects have a greater desire to do well 
in school than lower-class subjects, and 
that a similar difference exists between 
females and males, with this difference 
being greater in the lower class. As a 
result, school achievement tasks should 
be more ego involving to middle-class 
subjects than to lower-class subjects, 
and to females more than males, espe- 
cially in the lower class. 

2. There have been studies of the re- 
lationship between anxiety and per- 
formance under neutral and ego involv- 
ing conditions, and the gist of these 
studies is that highly anxious subjects 
perform better under neutral conditions, 
while less anxious subjects perform 
better under ego involving conditions. 
Therefore, the degree to which anxiety 
interferes with a subject’s performance 
in a situation depends upon the degree to 
which the subject is ego involved. 

3. Integrating these findings, since 
school achievement tasks are more ego 
involving to middle-class subjects, and 
to females more than males (particu- 
larly in the lower class), the degree to 
which anxiety interferes with school 


achievement should depend upon both 
the sex and the social class status of 
subjects. Taking into account these spe- 
cific relationships, anxiety should inter- 
fere the least with the school achieve- 
ment of lower-class males, and this is 
what the overall findings of this study 
showed. 
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